[GE users] SGE6 does not backfill

Juha Jäykkä juhaj at iki.fi
Sun Apr 10 16:41:56 BST 2005


    [ The following text is in the "ISO-8859-15" character set. ]
    [ Your display is set for the "ISO-8859-10" character set.  ]
    [ Some special characters may be displayed incorrectly. ]

> what is "qconf -tsm" giving in the
> /usr/sge/default/common/schedd_runlog?

Here it is. This looks quite ok to me, but of course I do not know how it
should happen. All the queues should be dropped, that is for sure, since
the first job in the queue reserves them. There are jobs, however, that
should be eligible for backfilling.


Sun Apr 10 18:34:53 2005|-------------START-SCHEDULER-RUN-------------
Sun Apr 10 18:34:53 2005|queue instance "all.q at topaasi.local" dropped because it is full
Sun Apr 10 18:34:53 2005|queues dropped because they are full: all.q at topaasi.local
Sun Apr 10 18:34:53 2005|Job 182 cannot run because available slots combined under PE "lam" are not in range of job
Sun Apr 10 18:34:53 2005|queue instance "all.q at compute-0-6.local" dropped because it is full
Sun Apr 10 18:34:53 2005|queue instance "all.q at compute-0-10.local" dropped because it is full
Sun Apr 10 18:34:53 2005|queue instance "all.q at compute-0-8.local" dropped because it is full
Sun Apr 10 18:34:53 2005|queue instance "all.q at compute-0-3.local" dropped because it is full
Sun Apr 10 18:34:53 2005|queue instance "all.q at compute-0-9.local" dropped because it is full
Sun Apr 10 18:34:53 2005|queue instance "all.q at compute-0-2.local" dropped because it is full
Sun Apr 10 18:34:53 2005|queue instance "all.q at compute-0-0.local" dropped because it is full
Sun Apr 10 18:34:53 2005|queue instance "all.q at compute-0-4.local" dropped because it is full
Sun Apr 10 18:34:53 2005|queue instance "all.q at compute-0-5.local" dropped because it is full
Sun Apr 10 18:34:53 2005|queue instance "all.q at compute-0-7.local" dropped because it is full
Sun Apr 10 18:34:53 2005|queue instance "all.q at compute-0-11.local" dropped because it is full
Sun Apr 10 18:34:53 2005|queue instance "all.q at compute-0-1.local" dropped because it is full
Sun Apr 10 18:34:53 2005|queue instance "all.q at topaasi.local" dropped because it is full
Sun Apr 10 18:34:53 2005|queues dropped because they are full: all.q at compute-0-6.local all.q at compute-0-10.local all.q at compute-0-8.local all.q at compute-0-3.local all.q at compute-0-9.local all.q at compute-0-2.local all.q at compute-0-0.local all.q at compute-0-4.local all.q at compute-0-5.local all.q at compute-0-7.local
Sun Apr 10 18:34:53 2005|queues dropped because they are full: all.q at compute-0-11.local all.q at compute-0-1.local all.q at topaasi.local
Sun Apr 10 18:34:53 2005|--------------STOP-SCHEDULER-RUN-------------

BTW, the queues have changed a little since my last mail, by the situation
at the moment is, there are three jobs running and job 182 requests 24
(=all) CPU's. The jobs have still many hours until their h_rt runs out.

In fact, I have never seen SGE6 backfill anything yet... I have only seen
the highest priority job being dispatched. What worries me here is, that
in the snipped from /sgeroot/cell/common/scheduler I sent in the first
mail, reservations are only done for the first job and the rest of the
jobs are apparently not even considered!

-- 
		 -----------------------------------------------
		| Juha Jäykkä, juolja at utu.fi			|
		| home: http://www.utu.fi/~juolja/		|
		 -----------------------------------------------

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list