[GE users] Preemption vs dedicating nodes by group?

Reuti reuti at staff.uni-marburg.de
Wed May 4 22:08:18 BST 2005


    [ The following text is in the "ISO-8859-1" character set. ]
    [ Your display is set for the "ISO-8859-10" character set.  ]
    [ Some special characters may be displayed incorrectly. ]

Quoting Jim Marconnet <jmarconnet at knology.net>:

<snip>
> Also, do you have an email, a memo, a readme.txt file, a webpage, or
> whatever you use now or you will use to educate/instruct your users so they
> know how it all works and what they must do and what they must not do to
> make this scheme work well for everyone that you could share? Having their
> job unexpectedly killed and restarted from the beginning after several
> days,
> weeks, or even months run time could be an unpleasant surprise! And
> unexpected, unexplained file corruption would be an even worse! Perhaps you
> just name your secondary queue "run_jobs_here_at_your_own_peril.q".
> 
> If they figure out that if they simply and conveniently leave off the -ckpt
> flag in their qsub command, it could be dangerous to the overall scheme.
> Who
> wants their jobs to be (voluntarily) killed and restarted if they can
> conveniently "leave off" that mysterious alphabet-soup flag and thus be
> immune from automatic kill/restart?

You could check in the queue prolog (or maybe better: starter_method) whether 
$SGE_CKPT_ENV is set - if not, don't run the job. - Reuti 

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list