[GE users] qhost MEMUSE

reuti reuti at staff.uni-marburg.de
Tue Feb 17 18:07:51 GMT 2009


Am 17.02.2009 um 18:14 schrieb futuritymmx:

> Yes we are experiencing jobs not being able to reserve memory.  At  
> such
> times the physical and swap memory appears to have been totally  
> used up.
>
> Thanks to your last response about the difference between "free -m"  
> value
> and "qhost" value, it appears that when there is free memory it may  
> be used
> by buffers and caches, but when the processes require all the  
> memory that
> these buffers and caches disappear as expected.
>
> I'm just trying to track down which users are submitting the  
> largest memory
> jobs so that they can provide accurate "vf" values to qsub.  As you  
> say, you
> have to track down the sum of all the memory usage by all the process
> created by each job.

You made vf consumable? Another option is to use h_vmem in a similar  
manner. Difference is, that h_vmem will be enforced, hence the jobs  
being killed if they consume too much memory. vf is only a guidance.


-- Reuit

> Scary task!
>
> Neil
>
> -----Original Message-----
> From: reuti [mailto:reuti at staff.uni-marburg.de]
> Sent: 17 February 2009 13:31
> To: users at gridengine.sunsource.net
> Subject: Re: [GE users] qhost MEMUSE
>
> Hi,
>
> Am 16.02.2009 um 20:43 schrieb futurity:
>
>> Thanks Reuti.
>>
>> Is there any easy way to gather job memory usage?
>
> well you could sum up in a script the consumption of all processes
> belonging to the sgeexecd. Do you need this to get the information
> about used memory by local interactive usage of a workstation outside
> of SGE?
>
> -- Reuti
>
>
>> Regards
>>
>> Neil
>>
>> -----Original Message-----
>> From: reuti [mailto:reuti at staff.uni-marburg.de]
>> Sent: 16 February 2009 17:38
>> To: users at gridengine.sunsource.net
>> Subject: Re: [GE users] qhost MEMUSE
>>
>> Hi,
>>
>> Am 16.02.2009 um 18:22 schrieb futurity:
>>
>>> I was wondering if the MEMUSE value returned by "qhost" represents
>>> the
>>> memory used by all processes on a machine, or just the memory  
>>> used by
>>> grid jobs running on it?
>>
>> It's from all processes on a node. Just the output you get also
>> from a command like:
>>
>> $ free -m
>>
>> (or -g) next to "+/- buffers". I.e. a system information. Otherwise
>> the output should read zero in an empty cluster.
>>
>> -- Reuti
>>
>>
>>> Regards
>>>
>>> Neil
>>
>> ------------------------------------------------------
>> http://gridengine.sunsource.net/ds/viewMessage.do?
>> dsForumId=38&dsMessageId=107424
>>
>> To unsubscribe from this discussion, e-mail: [users-
>> unsubscribe at gridengine.sunsource.net].
>>
>> ------------------------------------------------------
>> http://gridengine.sunsource.net/ds/viewMessage.do?
>> dsForumId=38&dsMessageId=107497
>>
>> To unsubscribe from this discussion, e-mail: [users-
>> unsubscribe at gridengine.sunsource.net].
>>
>
> ------------------------------------------------------
> http://gridengine.sunsource.net/ds/viewMessage.do? 
> dsForumId=38&dsMessageId=1
> 08123
>
> To unsubscribe from this discussion, e-mail:
> [users-unsubscribe at gridengine.sunsource.net].
>
> ------------------------------------------------------
> http://gridengine.sunsource.net/ds/viewMessage.do? 
> dsForumId=38&dsMessageId=108261
>
> To unsubscribe from this discussion, e-mail: [users- 
> unsubscribe at gridengine.sunsource.net].

------------------------------------------------------
http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=108290

To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].



More information about the gridengine-users mailing list