[GE users] Problem with commd communications

Craig Tierney ctierney at hpti.com
Mon Jun 14 22:04:01 BST 2004


On Mon, 2004-06-14 at 15:57, Rayson Ho wrote:
> >- When the system is no longer able to run jobs, the load on
> >  commd on qmaster is around 100% the whole time.  Generally
> >  the load on commd < 10%.
> 
> Can you attach a debugger and find out where it is looping??

I will try and do this the next time.  I need to rebuild
SGE so that the daemons have debugging information.  

> 
> Also, does restarting SGE daemons help??

Restarting SGE on the master does not help.  Sometimes migrating
the server to the shadow host helps, but not always.  

Craig


> 
> Rayson
> ---------------------------------------------------------
> Get your FREE E-mail account at http://www.eseenet.com !
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
> For additional commands, e-mail: users-help at gridengine.sunsource.net
> 


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list