[GE users] error starting execd

Jason Crane Jason.Crane at mrsc.ucsf.edu
Tue May 18 00:15:23 BST 2004


Hi,

I'm running 5.3p5 with a RH master and a combination of Solaris and RH 
exec nodes.  The qmaster and execd installations run just fine on the RH 
master node.  The execd install also works as expected on all Solaris 
hosts, but fails on all RH hosts:


Grid Engine execution daemon startup
------------------------------------

Starting execution daemon daemon. Please wait ...
   starting sge_execd
starting program: /netopt/sge/bin/glinux/sge_commd
using service "sge_commd"
bound to port 536
using service "sge_commd"
bound to port 536
error: getting configuration: unable to contact qmaster via "node1" 
commd using port 536 (service "sge_commd")
error: can't get configuration from qmaster -- backgrounding

- All RH /etc/hosts look like:
127.0.0.1      localhost.localdomain localhost
xx.xx.xx.xx    nodex    nodex.dom.org
yy.yy.yy.y     hostx    hostx.dom.org   loghost

- gethostbyname & gethostname seem to resolve correctly.

Any ideas would be much appreciated.

Thanks,
Jason



---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list