[GE users] au state

Robert Griffiths Robert.Griffiths at mitsubishi-sec-intl.com
Wed May 11 13:32:37 BST 2005


Hi Martin,

You could try a qstat -j to see what the scheduler thinks is wrong. It
should say something like 

Scheduling info:	queue instance "compute-0-1.q" dropped because it is
temporarily not available

Or something like that - I don't have any machines in that state now but
have seen it before!!

Also, when I've had similar problems where jobs cannot be scheduled, it's
indicative of the queue master becoming unresponsive or dying or something.
You could try "bouncing" the grid daemon - that might help.

I'm not sure if it matters what version you're running as to whether that
info is useful, but we've got a 5.3p6 grid and a 6.0.4 grid and that advice
would work for both.

Hope it helps!

Rob

-----Original Message-----
From: Wheeler, Dr M.D. [mailto:mdw10 at leicester.ac.uk] 
Sent: 11 May 2005 13:26
To: users at gridengine.sunsource.net
Subject: [GE users] au state


dear all,
one of the nodes of my clsuter has been removed for repair as it crashed
last week, SGE now shos this node as having an au state.  The problem is
that i cannot submit jobs to another node (even though it says that it is
free)  while the cluster is in this state?

# qstat -f
queuename            qtype used/tot. load_avg arch       states
----------------------------------------------------------------------------
32-bit-compute-0-0.q BIP   0/2       0.00     glinux
----------------------------------------------------------------------------
compute-0-0.q        BIP   2/2       2.00     lx24-amd64
   1929     0 HCl_H2O_2+ victorm      r     05/11/2005 11:35:38 MASTER
            0 HCl_H2O_2+ victorm      r     05/11/2005 11:35:38 SLAVE
            0 HCl_H2O_2+ victorm      r     05/11/2005 11:35:38 SLAVE
----------------------------------------------------------------------------
compute-0-1.q        BIP   0/2       99.99    lx24-amd64 au
----------------------------------------------------------------------------
compute-0-2.q        BIP   0/2       0.00     lx24-amd64

############################################################################
 - PENDING JOBS - PENDING JOBS - PENDING JOBS - PENDING JOBS - PENDING JOBS
############################################################################
   1931     0 formal     nantakorn    qw    05/11/2005 13:08:15


Does anyone have any ideas?

Thanks in advance,
Martyn


****************************************************************
Mitsubishi Securities International plc ('MSI') is 
registered in England, company number 1698498 and 
registered office at 6 Broadgate, London EC2M 2AA. 
MSI is part of the Mitsubishi Tokyo Financial Group 
and is authorised and regulated by The Financial 
Services Authority. This message is intended solely 
for the individual addressee named above. The 
information contained in this e-mail is confidential 
and may be legally privileged. If you are not the 
intended recipient please delete in its entirety. 
Messages sent via this medium may be subject to 
delays, non-delivery and unauthorised alteration. 
The information contained herein or attached hereto 
has been obtained from sources we believe to be 
reliable but we do not represent that it is accurate 
or complete. Any reference to past performance should 
not be taken as an indication of future performance. 
The information contained herein or attached hereto 
is not to be construed as an offer or solicitation to 
buy or sell any security, instrument or investment. 
MSI or any affiliated company, may have an interest, 
position, or effect transactions, in any investment 
mentioned herein. Any opinions or recommendations 
expressed herein are solely those of the author or 
analyst and are subject to change without notice.


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list