[GE users] Kill Jobs that appear to be doing nothing

cgull matt.mcnally at virgin.net
Mon Jan 11 11:01:39 GMT 2010


We recently had a couple of parallel jobs that stopped running. But the job hung and did not finish correctly. All the nodes related to this job once the job hung then had a load average of 0.00. Is there anyway functionality in SGE that if nodes are idle for a length of time say two hours and that they should have a job running on them. That the job on the machine would be killed, or a notification message sent?

Thanks for your help.

------------------------------------------------------
http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=238061

To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].



More information about the gridengine-users mailing list