[GE users] qmaster for 6.1U5 crashing

templedf dan.templeton at sun.com
Mon Feb 23 13:42:45 GMT 2009


What is ever set to 0:0:0?  What's in the messages file?

Daniel

magawake wrote:
> The scheduler interval is "0:0:15 "
>
>
> During the crash there are many jobs on the system. Probably 1000 array jobs.
>
> The job counter is close to 900k if that makes any difference. 
>
> TIA
>   
>> Am 21.02.2009 um 15:19 schrieb magawake:
>>
>>     
>>> In the past week our qmaster was in an endless loop. The cpu is was  
>>> at 100% and no communication to the execd.
>>>       
>> What is the setting of the schedule interval and how many jobs are in  
>> the system?
>>
>> -- Reuti
>>
>>
>>     
>>> The fix was to simple stop and restart the deamon, but I am not  
>>> sure what was causing this issue. Next time this occurs, is there  
>>> something I can do to get more info and submit a bug report? Like  
>>> logs, strace, debug info, etc..etc..
>>>
>>> TIA
>>>
>>> ------------------------------------------------------
>>> http://gridengine.sunsource.net/ds/viewMessage.do? 
>>> dsForumId=38&dsMessageId=111150
>>>
>>> To unsubscribe from this discussion, e-mail: [users- 
>>> unsubscribe at gridengine.sunsource.net].
>>>       
>
> ------------------------------------------------------
> http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=111924
>
> To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].
>

------------------------------------------------------
http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=112661

To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].



More information about the gridengine-users mailing list