[GE users] SGE 6.0u1: run time type error

Bogdan Lobodzinski bogdan.lobodzinski at desy.de
Mon Sep 20 10:40:00 BST 2004


Hello,

    my sge 6.0 (u1) server hangs when I try to remove some host from
execution host list.
Maybe it appears also in case of another machine list (administration,
groups,submission). I did not check this yet.

The symptoms are following:
After execution of the command:
% ./qconf -de <exec_host>
(or qmon analog)

my sge_qmaster hangs with entries in messages file:
----
...
09/18/2004 22:50:03|qmaster|erato|C|error: lGetElemHost(HR_name): run time
type error
...
----
and
ps -auxw | grep x24-x86
shows:
--------------
root     28128  0.0  0.4 24116 4312 ?        S    22:18   0:00
/usr/N1_GE6/bin/lx24-x86/sge_qmaster
root     28131  0.0  0.0     0    0 ?        Z    22:18   0:00
[sge_qmaster <defunct>]
root     28132  0.0  0.4 24116 4312 ?        S    22:18   0:00
/usr/N1_GE6/bin/lx24-x86/sge_qmaster
root     28133  0.0  0.4 24116 4312 ?        S    22:18   0:00
/usr/N1_GE6/bin/lx24-x86/sge_qmaster
root     28134  0.0  0.4 24116 4312 ?        S    22:18   0:00
/usr/N1_GE6/bin/lx24-x86/sge_qmaster
root     28135  0.0  0.4 24116 4312 ?        S    22:18   0:00
/usr/N1_GE6/bin/lx24-x86/sge_qmaster
root     28136  0.0  0.4 24116 4312 ?        S    22:18   0:00
/usr/N1_GE6/bin/lx24-x86/sge_qmaster
root     28139  0.0  0.4 24116 4312 ?        S    22:18   0:00
/usr/N1_GE6/bin/lx24-x86/sge_qmaster
root     28142  0.0  0.4 24116 4312 ?        S    22:18   0:00
/usr/N1_GE6/bin/lx24-x86/sge_qmaster
root     28145  0.1  0.3  4704 3676 ?        S    22:18   0:03
/usr/N1_GE6/bin/lx24-x86/sge_schedd
root     28152  0.0  0.1  2268 1212 ?        S    22:18   0:00
/usr/N1_GE6/bin/lx24-x86/sge_shadowd
-------
see zombi process !
Any use of sge is not possible.
After "kill -9 ..." and server restart host is removed and work seems to
be correct.

I use classical spool and SuSE 8.2.

Any help how to fix the error is welcome !


                  Thanks a lot,


			Bogdan

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list