[GE users] SGE 6.0u1: run time type error

Ron Chen ron_chen_123 at yahoo.com
Mon Sep 20 10:43:41 BST 2004


This problem was reported earlier this month, and the
fix is on the way.

http://gridengine.sunsource.net/issues/show_bug.cgi?id=1269

 -Ron


--- Bogdan Lobodzinski <bogdan.lobodzinski at desy.de>
wrote:

> 
> Hello,
> 
>     my sge 6.0 (u1) server hangs when I try to
> remove some host from
> execution host list.
> Maybe it appears also in case of another machine
> list (administration,
> groups,submission). I did not check this yet.
> 
> The symptoms are following:
> After execution of the command:
> % ./qconf -de <exec_host>
> (or qmon analog)
> 
> my sge_qmaster hangs with entries in messages file:
> ----
> ...
> 09/18/2004 22:50:03|qmaster|erato|C|error:
> lGetElemHost(HR_name): run time
> type error
> ...
> ----
> and
> ps -auxw | grep x24-x86
> shows:
> --------------
> root     28128  0.0  0.4 24116 4312 ?        S   
> 22:18   0:00
> /usr/N1_GE6/bin/lx24-x86/sge_qmaster
> root     28131  0.0  0.0     0    0 ?        Z   
> 22:18   0:00
> [sge_qmaster <defunct>]
> root     28132  0.0  0.4 24116 4312 ?        S   
> 22:18   0:00
> /usr/N1_GE6/bin/lx24-x86/sge_qmaster
> root     28133  0.0  0.4 24116 4312 ?        S   
> 22:18   0:00
> /usr/N1_GE6/bin/lx24-x86/sge_qmaster
> root     28134  0.0  0.4 24116 4312 ?        S   
> 22:18   0:00
> /usr/N1_GE6/bin/lx24-x86/sge_qmaster
> root     28135  0.0  0.4 24116 4312 ?        S   
> 22:18   0:00
> /usr/N1_GE6/bin/lx24-x86/sge_qmaster
> root     28136  0.0  0.4 24116 4312 ?        S   
> 22:18   0:00
> /usr/N1_GE6/bin/lx24-x86/sge_qmaster
> root     28139  0.0  0.4 24116 4312 ?        S   
> 22:18   0:00
> /usr/N1_GE6/bin/lx24-x86/sge_qmaster
> root     28142  0.0  0.4 24116 4312 ?        S   
> 22:18   0:00
> /usr/N1_GE6/bin/lx24-x86/sge_qmaster
> root     28145  0.1  0.3  4704 3676 ?        S   
> 22:18   0:03
> /usr/N1_GE6/bin/lx24-x86/sge_schedd
> root     28152  0.0  0.1  2268 1212 ?        S   
> 22:18   0:00
> /usr/N1_GE6/bin/lx24-x86/sge_shadowd
> -------
> see zombi process !
> Any use of sge is not possible.
> After "kill -9 ..." and server restart host is
> removed and work seems to
> be correct.
> 
> I use classical spool and SuSE 8.2.
> 
> Any help how to fix the error is welcome !
> 
> 
>                   Thanks a lot,
> 
> 
> 			Bogdan
> 
>
---------------------------------------------------------------------
> To unsubscribe, e-mail:
> users-unsubscribe at gridengine.sunsource.net
> For additional commands, e-mail:
> users-help at gridengine.sunsource.net
> 
> 



		
_______________________________
Do you Yahoo!?
Declare Yourself - Register online to vote today!
http://vote.yahoo.com

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list