[GE users] Eqw because of exit_status 100

SLIM H.A. h.a.slim at durham.ac.uk
Tue Jun 26 14:43:05 BST 2007


Dear all

If a program exits with code 100 SGE marks the job as being in error
("E") state and tries to rerun it. Some of our users run an application
that returns 100 as a general error state but there is no reason to
rerun the program with the same input, it simply failed. qacct -j gives
these lines:

failed       30  : rescheduling on application error
exit_status  100                 

The users submit these jobs in bulk causing qstat to produce an
unnecessary lengthy list of jobs in error.

Is there a way to avoid this (other than the application not using 100
as an exit code)?

Thanks Henk

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list