[GE users] Parallel job being allocated slots in different queues

robhorton r.horton at qmul.ac.uk
Mon Jan 18 16:11:44 GMT 2010


Hi,

On Mon, 2010-01-18 at 13:55 +0100, reuti wrote:
> > When the job was deleted and resubmitted it was scheduled as I would
> > expect. I've not seen anything similar happen before (the setup hasn't
> > changed for around six months). I'm running 6.1u6.
> >
> > Has anyone seen this before?
> 
> yes, this is the normal behavior. SGE just collects slots from all  
> eligible queues; maybe in former times your h_rt request was always  
> in a range which does not fit into the normal queue. When you don't  
> like behavior, you will have to define two PEs and bind only one to  
> each queue. Once SGE selected a PE for a job, it will stay in this PE  
> for sure. Best is to use a naming like "mype" and "mype_long", as you  
> can then just specify: qusb -pe "mype*" ...

Ah thanks - I hadn't realised that.

I guess the real problem is that it has selected two queue instances
which subordinate each other so both end up suspended.

> It's already an RFE that a parallel job should stay in a queue and/or  
> hostgroup.

It's here if anyone's interested:
http://gridengine.sunsource.net/issues/show_bug.cgi?id=1311

Rob

------------------------------------------------------
http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=239559

To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].



More information about the gridengine-users mailing list