[GE users] Eqw because of exit_status 100

Chris Dagdigian dag at sonsorol.org
Tue Jun 26 14:46:49 BST 2007


Check out the parameter FORBID_APPERROR=TRUE in the manpage for  
sge_conf -- that should do the trick.

Regards,
Chris




On Jun 26, 2007, at 9:43 AM, SLIM H.A. wrote:

>
> Dear all
>
> If a program exits with code 100 SGE marks the job as being in error
> ("E") state and tries to rerun it. Some of our users run an  
> application
> that returns 100 as a general error state but there is no reason to
> rerun the program with the same input, it simply failed. qacct -j  
> gives
> these lines:
>
> failed       30  : rescheduling on application error
> exit_status  100
>
> The users submit these jobs in bulk causing qstat to produce an
> unnecessary lengthy list of jobs in error.
>
> Is there a way to avoid this (other than the application not using 100
> as an exit code)?
>
> Thanks Henk
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
> For additional commands, e-mail: users-help at gridengine.sunsource.net
>

--
Chris Dagdigian  <dag at sonsorol.org>
Current coordinates: Boston-area, USA
GPS: http://bioteam.net/dagbin/gps?42.385693+N+71.115535+W



---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list