[GE users] controlling jobs on failed nodes

templedf dan.templeton at sun.com
Thu Aug 13 14:26:54 BST 2009


Look at the reschedule_unknown setting in the global host configuration 
(sge_conf(5)).

Daniel

snosov wrote:
> Hi,
>
> I was wondering if there was a way to make GE terminate/reschedule a 
> job, if a node that this job was running on does not respond for a 
> specified period of time. Currently, the setup that I have with 6.1u5 
> is that if a node goes down while in the middle of running a job, this 
> job stays in "running" state forever.
>
> Thank you,
> Serge.

------------------------------------------------------
http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=212133

To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].



More information about the gridengine-users mailing list