[GE users] job suspension based on queue priority
reuti at staff.uni-marburg.de
Wed Nov 5 14:58:48 GMT 2008
[ The following text is in the "UTF-8" character set. ]
[ Your display is set for the "ISO-8859-10" character set. ]
[ Some characters may be displayed incorrectly. ]
Am 05.11.2008 um 15:42 schrieb Jigar Halani:
> I am facing some issue in my environment, below are the details for
> the same. Appreciate for the help.
> I have total 100 system with 50 configured under small.q Queue
> (small jobs will run on this system), 30 system are configured
> under medium.q (medium jobs will run on this system) and I have 20
> systems configured under big.q (large jobs will run on this
> system). I am using SGE 6.1u5.
you mean: every system has only on queue on it?
> · I have to configure queues based on time limit, e.g.
> small.q will only allow jobs to be execute for 1 minute. I have set
> CPU limit for 1 minute, it works fine but that kills the job but
> user wants job to be suspended. Is there any option to set the same?
You can suspend jobs, but when there is only one queue on it, it
makes hardly sense. What would be your condition to unsuspend the job
> · Second i am also not getting any clue on how to configure
> job suspension based on the Queue priority. E.g. Let?s say I have
> 10 licenses of a application-A and I am currently running 10 jobs
> in big.q, now one user submit?s a job in small.q, so the SGE
> should suspend the last job submitted in the big.q and occupy the
> license for the job submitted in small.q.
This is not possible in SGE without the help of a co-scheduler. As
all licenses are used up, SGE wouldn't schedule any additonal job.
Especially, as also suspended jobs still use the resources. The co-
scheduler would need to suspend a job (which also must agree to give
the license back) and adjust the license count in SGE.
> Once the job is over (as the job will not run for more then 1
> minute users are OK by suspending big.q jobs) the resource
> (license) will be given back to the suspended job. I have gone
> through thehttp://gridengine.sunsource.net/ds/viewMessage.do?
> dsForumId=38&dsMessageId=72816 thread but have not got the solution.
> Awaiting for your reply and thanks in advance.
> Thanks and regards,
> Jigar Halani
> Talentain Technologies Pvt. Ltd.
> __________ Information from ESET Smart Security, version of virus
> signature database 3586 (20081105) __________
> The message was checked by ESET Smart Security.
To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].
More information about the gridengine-users