[GE issues] [Issue 2842] listener threads get stuck in cl_commlib_receive_message

tholzer tholzer at wetafx.co.nz
Fri Dec 19 02:44:41 GMT 2008


http://gridengine.sunsource.net/issues/show_bug.cgi?id=2842






------- Additional comments from tholzer at sunsource.net Thu Dec 18 18:44:34 -0800 2008 -------
After I see the following in the log, the qmaster stops responding to GDI
requests and memory usage climbs steadily until the limit (48GB) has been
reached (usually within 5-10 minutes).

12/19/2008 15:35:55|worker|yori|E|unable to find job 56 from the scheduler order
package
12/19/2008 15:35:55|worker|yori|W|Skipping remaining 219 orders
12/19/2008 15:35:55|schedu|yori|E|unable to find job 56 from the scheduler order
package
12/19/2008 15:35:56|worker|yori|E|scheduler tries to schedule job 1416.1 twice
12/19/2008 15:35:56|worker|yori|W|Skipping remaining 219 orders
12/19/2008 15:35:56|schedu|yori|E|scheduler tries to schedule job 1416.1 twice
12/19/2008 15:37:03|worker|yori|E|unable to find job 22 from the scheduler order
package
12/19/2008 15:37:03|worker|yori|W|Skipping remaining 230 orders
12/19/2008 15:37:03|schedu|yori|E|unable to find job 22 from the scheduler order
package
12/19/2008 15:37:05|schedu|yori|E|could not find job "56" in master list
12/19/2008 15:37:05|schedu|yori|E|callback function for event "29632. EVENT DEL
JOB 56.1" failed
12/19/2008 15:38:12|schedu|yori|E|could not find job "22" in master list
12/19/2008 15:38:12|schedu|yori|E|callback function for event "34758. EVENT DEL
JOB 22.1" failed

I'm still putting together the dependency tree for the jobs.

------------------------------------------------------
http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=36&dsMessageId=93298

To unsubscribe from this discussion, e-mail: [issues-unsubscribe at gridengine.sunsource.net].



More information about the gridengine-users mailing list