[GE users] sge_qmaster service quits very often

manju a manju.kudu at gmail.com
Wed Apr 16 18:47:14 BST 2008


    [ The following text is in the "ISO-8859-1" character set. ]
    [ Your display is set for the "ISO-8859-10" character set.  ]
    [ Some special characters may be displayed incorrectly. ]

sundeep,

qmaster service getting crash!!! thats the issue.

thanks
Manjunath

On Wed, Apr 16, 2008 at 5:13 PM, Sandeep, Patel(IE10) <
Sandeep.Patel2 at honeywell.com> wrote:

>  Hi
>
>
>
>    Ur qmaster is running or not?
>
>
>
>
>
> Regards
>
> Sandeep
>
>
>  ------------------------------
>
> *From:* manju a [mailto:manju.kudu at gmail.com]
> *Sent:* Tuesday, April 15, 2008 7:57 PM
> *To:* users at gridengine.sunsource.net
> *Subject:* [GE users] sge_qmaster service quits very often
>
>
>
> Hi all,
>
> is any body aware of the following error
>
> qhost
>
> >> error: commlib error: can't connect to service (Connection refused)
>
> >> error: unable to contact qmaster using port 534 on host
>
> >> "masterserver.abc.com"
>
>
> i checked the service,  qmaster service was not running , i tired
> restarting the service it was giving some error "unable to unpack gid,
> unable to read the q master configuration "...
>
> in $SGE_ROOT/spool/message i can see the below message
>
>
> 04/15/2008 04:41:16|qmaster|grimmcs1vl|I|starting up SGE 6.1u2 (lx24-x86)
> 04/15/2008 15:57:57|qmaster|grimmcs1vl|E|acknowledge timeout after 600
> seconds for event client (schedd:1) on host "masterserver.abc.com"
> 04/15/2008 16:01:49|qmaster|grimmcs1vl|E|commlib error: got read error
> (closing "masterserver.abc.com/qhost/2")
>
> all happens suddenly ,  is any body aware of this problem??
>
> thanks
> manjunath A
>
>
>
>
>
>



More information about the gridengine-users mailing list