[GE users] SGE6 does not backfill

Reuti reuti at staff.uni-marburg.de
Sun Apr 10 17:51:51 BST 2005


    [ The following text is in the "ISO-8859-1" character set. ]
    [ Your display is set for the "ISO-8859-10" character set.  ]
    [ Some special characters may be displayed incorrectly. ]

I just saw backfilling to happen for me.

The jobs still running have a h_rt set, and the new submitted serial ones also?

I wonder, why all of your queues are dropped. qstat -f shows that all are empty 
besides the three running ones?

CU - Reuti

Quoting Juha Jäykkä <juhaj at iki.fi>:

> > what is "qconf -tsm" giving in the
> > /usr/sge/default/common/schedd_runlog?
> 
> Here it is. This looks quite ok to me, but of course I do not know how it
> should happen. All the queues should be dropped, that is for sure, since
> the first job in the queue reserves them. There are jobs, however, that
> should be eligible for backfilling.
> 
> 
> Sun Apr 10 18:34:53 2005|-------------START-SCHEDULER-RUN-------------
> Sun Apr 10 18:34:53 2005|queue instance "all.q at topaasi.local" dropped because
> it is full
> Sun Apr 10 18:34:53 2005|queues dropped because they are full:
> all.q at topaasi.local
> Sun Apr 10 18:34:53 2005|Job 182 cannot run because available slots combined
> under PE "lam" are not in range of job
> Sun Apr 10 18:34:53 2005|queue instance "all.q at compute-0-6.local" dropped
> because it is full
> Sun Apr 10 18:34:53 2005|queue instance "all.q at compute-0-10.local" dropped
> because it is full
> Sun Apr 10 18:34:53 2005|queue instance "all.q at compute-0-8.local" dropped
> because it is full
> Sun Apr 10 18:34:53 2005|queue instance "all.q at compute-0-3.local" dropped
> because it is full
> Sun Apr 10 18:34:53 2005|queue instance "all.q at compute-0-9.local" dropped
> because it is full
> Sun Apr 10 18:34:53 2005|queue instance "all.q at compute-0-2.local" dropped
> because it is full
> Sun Apr 10 18:34:53 2005|queue instance "all.q at compute-0-0.local" dropped
> because it is full
> Sun Apr 10 18:34:53 2005|queue instance "all.q at compute-0-4.local" dropped
> because it is full
> Sun Apr 10 18:34:53 2005|queue instance "all.q at compute-0-5.local" dropped
> because it is full
> Sun Apr 10 18:34:53 2005|queue instance "all.q at compute-0-7.local" dropped
> because it is full
> Sun Apr 10 18:34:53 2005|queue instance "all.q at compute-0-11.local" dropped
> because it is full
> Sun Apr 10 18:34:53 2005|queue instance "all.q at compute-0-1.local" dropped
> because it is full
> Sun Apr 10 18:34:53 2005|queue instance "all.q at topaasi.local" dropped because
> it is full
> Sun Apr 10 18:34:53 2005|queues dropped because they are full:
> all.q at compute-0-6.local all.q at compute-0-10.local all.q at compute-0-8.local
> all.q at compute-0-3.local all.q at compute-0-9.local all.q at compute-0-2.local
> all.q at compute-0-0.local all.q at compute-0-4.local all.q at compute-0-5.local
> all.q at compute-0-7.local
> Sun Apr 10 18:34:53 2005|queues dropped because they are full:
> all.q at compute-0-11.local all.q at compute-0-1.local all.q at topaasi.local
> Sun Apr 10 18:34:53 2005|--------------STOP-SCHEDULER-RUN-------------
> 
> BTW, the queues have changed a little since my last mail, by the situation
> at the moment is, there are three jobs running and job 182 requests 24
> (=all) CPU's. The jobs have still many hours until their h_rt runs out.
> 
> In fact, I have never seen SGE6 backfill anything yet... I have only seen
> the highest priority job being dispatched. What worries me here is, that
> in the snipped from /sgeroot/cell/common/scheduler I sent in the first
> mail, reservations are only done for the first job and the rest of the
> jobs are apparently not even considered!
> 
> -- 
> 		 -----------------------------------------------
> 		| Juha Jäykkä, juolja at utu.fi			|
> 		| home: http://www.utu.fi/~juolja/		|
> 		 -----------------------------------------------
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
> For additional commands, e-mail: users-help at gridengine.sunsource.net
> 
> 



---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list