[GE users] job not killed when h_cpu limit reached

Rene Salmon rsalmon at tulane.edu
Fri Aug 20 19:39:33 BST 2004


Hi,


I have some queues setup which run parallel jobs and single cpu jobs.
I have set these cpu time limits on all the queues.

s_cpu                48:00:00
h_cpu                50:00:00


If I understand correctly any job that reaches a cpu running time of
50 hours or more should get killed correct?

For some reason I have an MPI job that is still in the queues after
about 60 cpu hours.  The job is not really running all the CPUs in that
queue are idle but the job is listed as running in qstat and the job does
not exit.

compute-0-1.q        P     2/2       1.00     lx24-amd64
    285     0 dyn13      xxxxxx       r     08/17/2004 11:22:12 MASTER
            0 dyn13      xxxxxx       r     08/17/2004 11:22:12 SLAVE
            0 dyn13      xxxxxx       r     08/17/2004 11:22:12 SLAVE
----------------------------------------------------------------------------
compute-0-2.q        P     2/2       0.00     lx24-amd64
    285     0 dyn13      xxxxxx       r     08/17/2004 11:22:12 SLAVE
            0 dyn13      xxxxxx       r     08/17/2004 11:22:12 SLAVE
----------------------------------------------------------------------------
compute-0-3.q        P     2/2       0.00     lx24-amd64
    285     0 dyn13      xxxxxx       r     08/17/2004 11:22:12 SLAVE
            0 dyn13      xxxxxx       r     08/17/2004 11:22:12 SLAVE


I am missing something?  Does anyone know if I need to set some other
variable to make this job die after 50 cpu hours?

Thank you
Rene




---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list