[GE users] controlling jobs on failed nodes

rayson rayrayson at gmail.com
Thu Aug 13 00:15:59 BST 2009

  sge_conf(5) -- reschedule_unknown
  queue_conf(5) -- rerun


On 8/12/09, snosov <serge.nosov2 at gmail.com> wrote:
> Hi,
> I was wondering if there was a way to make GE terminate/reschedule a job, if
> a node that this job was running on does not respond for a specified period
> of time. Currently, the setup that I have with 6.1u5 is that if a node goes
> down while in the middle of running a job, this job stays in "running" state
> forever.
> Thank you,
> Serge.


To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].

More information about the gridengine-users mailing list