[GE users] sge_qmaster service quits very often

manju a manju.kudu at gmail.com
Tue Apr 15 15:26:47 BST 2008


    [ The following text is in the "ISO-8859-1" character set. ]
    [ Your display is set for the "ISO-8859-10" character set.  ]
    [ Some special characters may be displayed incorrectly. ]

Hi all,

is any body aware of the following error

qhost

>> error: commlib error: can't connect to service (Connection refused)

>> error: unable to contact qmaster using port 534 on host

>> "masterserver.abc.com"

i checked the service,  qmaster service was not running , i tired restarting
the service it was giving some error "unable to unpack gid, unable to read
the q master configuration "...

in $SGE_ROOT/spool/message i can see the below message


04/15/2008 04:41:16|qmaster|grimmcs1vl|I|starting up SGE 6.1u2 (lx24-x86)
04/15/2008 15:57:57|qmaster|grimmcs1vl|E|acknowledge timeout after 600
seconds for event client (schedd:1) on host "masterserver.abc.com"
04/15/2008 16:01:49|qmaster|grimmcs1vl|E|commlib error: got read error
(closing "masterserver.abc.com/qhost/2")

all happens suddenly ,  is any body aware of this problem??

thanks
manjunath A



More information about the gridengine-users mailing list