[GE users] Jobs being suspended incorrectly
reuti
reuti at staff.uni-marburg.de
Fri Apr 16 22:45:01 BST 2010
Am 16.04.2010 um 23:30 schrieb opoplawski:
> On 04/16/2010 03:23 PM, opoplawski wrote:
>> On 04/07/2010 08:29 AM, reuti wrote:
>>>
>>> can you please post the queue definitions.
>>
>> Okay, here we go:
>>
>> $ qstat -u \* | grep apapane
>> 16680 0.56000 run_cora.c dombroski S 04/16/2010 15:01:34
>> mpi at apapane.cora.nwra.com 8
>>
>> $ qstat -f | grep apapane
>> admin.q at apapane.cora.nwra.com BIPC 0/0/1 0.03 lx26-
>> amd64
>> ivm.q at apapane.cora.nwra.com BIPC 0/0/4 0.03 lx26-
>> amd64
>> compute.q at apapane.cora.nwra.co BIPC 0/4/4 0.03 lx26-
>> amd64 S
>> mpi at apapane.cora.nwra.com PC 0/4/4 0.03 lx26-
>> amd64
>>
>> Why does compute.q at apapane show 4 slots used?
>> Why is the job in S when it is in the mpi queue?
>
> Okay, I figured this out - this is an 8 cpu job - it put 4 in
> compute.q at mpi and 4 in mpi at apapane. Hmmm, how to I avoid this?
It's necessary not to attach the same PE to more than one queue -
otherwise you can get a mix of slots from various queues. You can use
a similar name for the two PEs and request a PE by its starting
letters followed by a wildcard. Once a PE was selected, it will stay
in this PE and collect slots from this one only.
-- Reuti
>
> --
> Orion Poplawski
> Technical Manager 303-415-9701 x222
> NWRA/CoRA Division FAX: 303-415-9702
> 3380 Mitchell Lane orion at cora.nwra.com
> Boulder, CO 80301 http://www.cora.nwra.com
>
> ------------------------------------------------------
> http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=253747
>
> To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net
> ].
------------------------------------------------------
http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=253751
To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].
More information about the gridengine-users
mailing list