[GE users] Is this a bug?

reuti reuti at staff.uni-marburg.de
Tue Dec 9 16:43:18 GMT 2008


Am 09.12.2008 um 16:58 schrieb Aaron Turner:

> reuti wrote:
>> Then it will set h_vmem=0 but not infinity, you can of course specify
>> h_vmem=infinity. As one can expect, the application will crash with
>> zero memory.
>
> Is perhaps the failure mode of the application somehow dragging  
> down the
> queue? The error states on the queues can be cleared relatively  
> easily,
> but it does block new jobs until the point that this has been done.

What exactly is stated in the messages file ($SGE_ROOT/default/common/ 
qmaster/messages). Just "queue xy was put into error state because of  
jobs xy failure?).

If I submit a job with h_vmem=0 the job is killed immediately by SGE  
as already it's startup uses too much memory.

-- Reuti


>> But I can't see queues getting into error state after this. Which
>> version of SGE are you using on which platform?
>
> We are running on Scientific Linux (4.5) on Opterons, and we had this
> issue under 6.1u4 and it still happens with 6.2
>
> Regards,
>
>    Aaron Turner
>
> ------------------------------------------------------
> http://gridengine.sunsource.net/ds/viewMessage.do? 
> dsForumId=38&dsMessageId=91964
>
> To unsubscribe from this discussion, e-mail: [users- 
> unsubscribe at gridengine.sunsource.net].

------------------------------------------------------
http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=91969

To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].



More information about the gridengine-users mailing list