[GE users] complexes - debugging internal count

Stephan Grell - Sun Germany - SSG - Software Engineer stephan.grell at sun.com
Tue Mar 22 07:42:56 GMT 2005



Olesen, Mark wrote:

>I am seeing somewhat sporadic behaviour with my complexes (licenses) in that
>a second job sometimes starts when there *should* not be any complexes left.
>I'm being to suspect that my external load adjuster may be the root, but I'd
>like to debug the internal complex usage as jobs start/stop.
>  
>
Sounds, as if it must have to do something with load values as 
consumables.....

>I'm still using 6.0u1 - is there a means of obtaining this internal
>information in the meantime, or is parsing the qstat output still the only
>means?
>  
>
Hm.. you could use the monitor file. It states all the resource used by 
the running jobs:

21722:1:STARTING:1111477231:87000:G:global:test:3.000000
21722:1:STARTING:1111477231:87000:Q:test1.q at scrabe.workgroup:slots:1.000000
::::::::
21722:1:RUNNING:1111477231:87000:G:global:test:3.000000
21722:1:RUNNING:1111477231:87000:Q:test1.q at scrabe.workgroup:slots:1.000000
::::::::
21722:1:RUNNING:1111477231:87000:G:global:test:3.000000
21722:1:RUNNING:1111477231:87000:Q:test1.q at scrabe.workgroup:slots:1.000000
::::::::
21722:1:RUNNING:1111477231:87000:G:global:test:3.000000
21722:1:RUNNING:1111477231:87000:Q:test1.q at scrabe.workgroup:slots:1.000000

Or of course qstat -F...

You find the monitor setting at: $SGE_ROOT/$SGE_CELL/common/schedule. The
monitoring is enabled via qconf -msconf / params monitor=1

Stephan

>/mark
>
>---------------------------------------------------------------------
>To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
>For additional commands, e-mail: users-help at gridengine.sunsource.net
>
>  
>


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list