[GE users] "h_rt" or "s_rt" for predicting job end times
dan.templeton at sun.com
Thu Feb 12 14:44:14 GMT 2009
[ The following text is in the "UTF-8" character set. ]
[ Your display is set for the "ISO-8859-10" character set. ]
[ Some characters may be displayed incorrectly. ]
h_rt -> SIGKILL when limit exceeded
s_rt -> SIGUSR1 when limit exceeded, SIGKILL <notify> seconds later
h_cpu -> SIGKILL when limit exceeded -- RACE: OS may send SIGXCPU first
s_cpu -> SIGXCPU when limit exceeded -- Used with h_cpu to send a warning
> We?re using Grid Engine 6.1 and need some help deciding which out of
> "h_rt" and "s_rt" our jobs should be using in order to help the
> scheduler predict when jobs will finish.
> When I posted recently about our reservation problems, Reuti suggested
> I look into using ?h_rt?. Unfortunately the Admin and User PDF guides
> don?t contain any information on either "h_rt" or "s_rt", so I had to
> experiment to find out what it does.
> From my experiments, it appears that ?h_rt? sets a run time per job,
> which is used by the scheduler to predict when jobs finish.
> Unfortunately, it causes jobs to be terminated if they run for longer
> than this specified time. I?m guessing that ?h_? stands for a hard
> limit and this is why jobs are terminated when then exceed this?
> I?m guessing that ?s_rt? is a soft limit? I?m hoping that this means
> that once the time specified by the job is reached, that it does ?NOT?
> terminate the job? i.e. if the user specified the wrong time limit by
> accident, or the job ran slower for some reason, that the job would be
> allowed to continue running?
> Does anyone know if ?s_rt? is also used by the scheduler in the same
> way that ?h_rt? is used and if the only difference would be that one
> terminates and the other doesn?t?
> Sorry for all these questions but I can?t seem to find any
> documentation on these two settings. If anyone can point me at some
> documentation it would be really appreciated.
> Many thanks,
To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].
More information about the gridengine-users