[GE users] SGE 6.0u1: run time type error

Bogdan Lobodzinski bogdan.lobodzinski at desy.de
Mon Sep 20 10:54:54 BST 2004


Thank you for the answer.
Sorry, I did not find the problem in my searches.

            Cheers,

               Bogdan

On Mon, 20 Sep 2004, Ron Chen wrote:

> This problem was reported earlier this month, and the
> fix is on the way.
>
> http://gridengine.sunsource.net/issues/show_bug.cgi?id=1269
>
>  -Ron
>
>
> --- Bogdan Lobodzinski <bogdan.lobodzinski at desy.de>
> wrote:
>
> >
> > Hello,
> >
> >     my sge 6.0 (u1) server hangs when I try to
> > remove some host from
> > execution host list.
> > Maybe it appears also in case of another machine
> > list (administration,
> > groups,submission). I did not check this yet.
> >
> > The symptoms are following:
> > After execution of the command:
> > % ./qconf -de <exec_host>
> > (or qmon analog)
> >
> > my sge_qmaster hangs with entries in messages file:
> > ----
> > ...
> > 09/18/2004 22:50:03|qmaster|erato|C|error:
> > lGetElemHost(HR_name): run time
> > type error
> > ...
> > ----
> > and
> > ps -auxw | grep x24-x86
> > shows:
> > --------------
> > root     28128  0.0  0.4 24116 4312 ?        S
> > 22:18   0:00
> > /usr/N1_GE6/bin/lx24-x86/sge_qmaster
> > root     28131  0.0  0.0     0    0 ?        Z
> > 22:18   0:00
> > [sge_qmaster <defunct>]
> > root     28132  0.0  0.4 24116 4312 ?        S
> > 22:18   0:00
> > /usr/N1_GE6/bin/lx24-x86/sge_qmaster
> > root     28133  0.0  0.4 24116 4312 ?        S
> > 22:18   0:00
> > /usr/N1_GE6/bin/lx24-x86/sge_qmaster
> > root     28134  0.0  0.4 24116 4312 ?        S
> > 22:18   0:00
> > /usr/N1_GE6/bin/lx24-x86/sge_qmaster
> > root     28135  0.0  0.4 24116 4312 ?        S
> > 22:18   0:00
> > /usr/N1_GE6/bin/lx24-x86/sge_qmaster
> > root     28136  0.0  0.4 24116 4312 ?        S
> > 22:18   0:00
> > /usr/N1_GE6/bin/lx24-x86/sge_qmaster
> > root     28139  0.0  0.4 24116 4312 ?        S
> > 22:18   0:00
> > /usr/N1_GE6/bin/lx24-x86/sge_qmaster
> > root     28142  0.0  0.4 24116 4312 ?        S
> > 22:18   0:00
> > /usr/N1_GE6/bin/lx24-x86/sge_qmaster
> > root     28145  0.1  0.3  4704 3676 ?        S
> > 22:18   0:03
> > /usr/N1_GE6/bin/lx24-x86/sge_schedd
> > root     28152  0.0  0.1  2268 1212 ?        S
> > 22:18   0:00
> > /usr/N1_GE6/bin/lx24-x86/sge_shadowd
> > -------
> > see zombi process !
> > Any use of sge is not possible.
> > After "kill -9 ..." and server restart host is
> > removed and work seems to
> > be correct.
> >
> > I use classical spool and SuSE 8.2.
> >
> > Any help how to fix the error is welcome !
> >
> >
> >                   Thanks a lot,
> >
> >
> > 			Bogdan
> >
> >
> ---------------------------------------------------------------------
> > To unsubscribe, e-mail:
> > users-unsubscribe at gridengine.sunsource.net
> > For additional commands, e-mail:
> > users-help at gridengine.sunsource.net
> >
> >
>
>
>
>
> _______________________________
> Do you Yahoo!?
> Declare Yourself - Register online to vote today!
> http://vote.yahoo.com
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
> For additional commands, e-mail: users-help at gridengine.sunsource.net
>



---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list