[GE users] SGE Loading

craffi dag at sonsorol.org
Thu May 27 11:13:09 BST 2010

Behavior as you describe is unusual enough to imply that the previous 
admin has made some configuration changes ( I think ).

By default the SGE scheduler will send your job to the "least busy" 
system among the entire set of nodes that are able to satisfy all of the 
job requirements. On clusters with identical hardware you would 
generally see the job dispersal/scattering that you are seeking.

There are a few things that could be happening:

- You have nodes that are in unusual states and thus are not really able 
to accept work

- someone has altered the load_adjust_threshold to be too low

- someone has swiched from load based queue instance sorting to "seqno"

If you posted the output of these commands the list might be able to 
make concrete suggestions:

(1) show the current state of your cluster via "qstat -f"

(2) dump the qmaster config via "qconf -sconf"

(3) show the scheduler config via "qconf -ssconf"

OOPS - forgot to ask a real question. Are these parallel jobs? If they 
are parallel jobs and someone has configured the PE with an allocation 
rule of $fill_up than you would see the job "packing" that you describe. 
With parallel jobs you have a bit more control within the PE object as 
to how jobs are scattered across machines.


sli1que wrote:
> I have a SGE setup that I have taken over administration. Currently, I don't see jobs being evenly disbursted among a specific cluster. ie cluster A has 10 identical computers and if I submit 10 jobs, I would expect each computer to get assigned 1 job. I actually get about 6 or 7 on one machine and the other will go to other machines. Can anyone help me to understand what I need to do or where I need to look?
> Thanks
> in Advance.


To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].

More information about the gridengine-users mailing list