[GE users] NEEDING HELP IN ADMINISTRATING SGE6.1

Marco Donauer Marco.Donauer at Sun.COM
Tue Jul 24 10:51:46 BST 2007


    [ The following text is in the "iso-8859-1" character set. ]
    [ Your display is set for the "ISO-8859-10" character set.  ]
    [ Some special characters may be displayed incorrectly. ]

Alaya,

it ooks like that these execd's are not running:

queue.q at hilbert.pasteur.rns.tn <mailto:queue.q at hilbert.pasteur.rns.tn> 
BIP   0/4       -NA-     -NA-          au
----------------------------------------------------------------------------
queue.q at lucy.pasteur.rns.tn <mailto:queue.q at lucy.pasteur.rns.tn>    
BIP   0/4       -NA-     lx24-x86      au
----------------------------------------------------------------------------
queue_test at hilbert.pasteur.rns <mailto:queue_test at hilbert.pasteur.rns> 
BIP   0/1       -NA-     -NA-          au
----------------------------------------------------------------------------
queue_test at loir.pasteur.rns.tn <mailto:queue_test at loir.pasteur.rns.tn> 
BIP   0/1       -NA-     -NA-          au
----------------------------------------------------------------------------
queue_test at lucy.pasteur.rns.tn <mailto:queue_test at lucy.pasteur.rns.tn> 
BIP   0/1       -NA-     lx24-x86      au

They are in alarm and unknown state. The execds were not able to connect 
to qmaster, or they didn't start.
Please have look into the messages file, or try to restart the execution 
daemons on these hosts!

Marco

Nourhéne Alaya wrote:
> Thank you for answering my request.
> the out put of qhost:
> HOSTNAME                ARCH         NCPU  LOAD  MEMTOT  MEMUSE  
> SWAPTO  SWAPUS
> -------------------------------------------------------------------------------
> global                  -               -     -       -       -       
> -       -
> hilbert                 -               -     -       -       -       
> -       -
> ibnjazzar               lx24-x86        1  0.92  496.3M  289.1M  
> 957.0M  139.0M
> loir                    -               -     -       -       -       
> -       -
> lucy                    lx24-x86        1     -  495.3M       -    
> 1.4G       -
> nicolle                 lx24-x86        1  0.10 1003.9M  235.6M    
> 2.0G  104.6M
> parallele01             -               -     -       -       -       
> -       -
> parallele02             lx24-x86        1  0.00  494.9M  270.9M    
> 2.0G   20.0M
> parallele03             lx24-x86        1  0.00  494.9M  279.4M 
> 1019.7M   19.9M
> parallele04             lx24-x86        1  0.00  494.9M  265.4M 
> 1019.7M   20.0M
> parallele05             lx24-x86        1  0.01  494.9M  278.3M    
> 7.8M    7.8M
> parallele06             lx24-x86        1  0.00  494.9M  268.7M    
> 1.0G   38.2M
> parallele07             lx24-x86        1  0.06  494.9M  269.7M    
> 1.0G   38.2M
> parallele08             lx24-x86        1  0.00  494.9M  272.8M    
> 1.0G   38.2M
> parallele09             -               -     -       -       -       
> -       -
> parallele10             lx24-x86        1  0.00  494.9M  267.3M    
> 1.0G    1.7M
>
> The output of  qstat -f  is :
>  sge at hilbert:/opt/sge/examples/jobs$ qstat -f
> queuename                      qtype used/tot. load_avg arch          
> states
> ----------------------------------------------------------------------------
> queue.q at hilbert.pasteur.rns.tn <mailto:queue.q at hilbert.pasteur.rns.tn> 
> BIP   0/4       -NA-     -NA-          au
> ----------------------------------------------------------------------------
> queue.q at lucy.pasteur.rns.tn <mailto:queue.q at lucy.pasteur.rns.tn>    
> BIP   0/4       -NA-     lx24-x86      au
> ----------------------------------------------------------------------------
> queue.q at parallele02.pasteur.rn <mailto:queue.q at parallele02.pasteur.rn> 
> BIP   0/4       0.01     lx24-x86
> ----------------------------------------------------------------------------
> queue_test at hilbert.pasteur.rns <mailto:queue_test at hilbert.pasteur.rns> 
> BIP   0/1       -NA-     -NA-          au
> ----------------------------------------------------------------------------
> queue_test at loir.pasteur.rns.tn <mailto:queue_test at loir.pasteur.rns.tn> 
> BIP   0/1       -NA-     -NA-          au
> ----------------------------------------------------------------------------
> queue_test at lucy.pasteur.rns.tn <mailto:queue_test at lucy.pasteur.rns.tn> 
> BIP   0/1       -NA-     lx24-x86      au
> ----------------------------------------------------------------------------
> queue_test at nicolle.pasteur.rns <mailto:queue_test at nicolle.pasteur.rns> 
> BIP   0/1       0.10     lx24-x86
> ----------------------------------------------------------------------------
> queue_test at parallele02.pasteur <mailto:queue_test at parallele02.pasteur> 
> BIP   0/1       0.01     lx24-x86
> ----------------------------------------------------------------------------
> queue_test at parallele03.pasteur <mailto:queue_test at parallele03.pasteur> 
> BIP   0/1       0.00     lx24-x86
> ----------------------------------------------------------------------------
> queue_test at parallele04.pasteur <mailto:queue_test at parallele04.pasteur> 
> BIP   0/1       0.00     lx24-x86
> ----------------------------------------------------------------------------
> queue_test at parallele05.pasteur <mailto:queue_test at parallele05.pasteur> 
> BIP   0/1       0.00     lx24-x86
>
> ############################################################################
>  - PENDING JOBS - PENDING JOBS - PENDING JOBS - PENDING JOBS - PENDING 
> JOBS
> ############################################################################
>      65 0.00000 Sleeper    sge          qw    07/23/2007 15:35:40     1
>      66 0.00000 simple.sh  sge          qw    07/23/2007 16:06:56     1
>      67 0.00000 simple.sh  sge          qw    07/23/2007 16:24:59     1
>      70 0.00000 Sleeper    sge          qw    07/23/2007 16:46:50     1
>      71 0.00000 simple.sh  sge          qw    07/23/2007 16:52:54     1
>      72 0.00000 simple.sh  sge          qw    07/24/2007 09:41:38     1
>      73 0.00000 simple.sh  sge          qw    07/24/2007 10:09:14     
> 1 1-5:1
>      74 0.00000 simple.sh  sge          qw    07/24/2007 10:16:08     1
>
>
>
>
>
>
> -- 
> Si on veut on peut il suffit de faire le premier pas 

-- 

Sun Microsystems GmbH         Marco Donauer
Dr.-Leo-Ritter-Str. 7         N1 Grid Engine Engineering
D-93049 Regensburg            Phone: +49 (0)941 3075-211  (x60211)
Germany                       Fax: +49 (0)941 3075-222  (x60222)
http://www.sun.com/gridware
mailto:marco.donauer at sun.com
Sitz der Gesellschaft: Sun Microsystems GmbH, Sonnenallee 1, 
D-85551 Kirchheim-Heimstetten
Amtsgericht Muenchen: HRB 161028
Geschaeftsfuehrer: Wolfgang Engels, Dr. Roland Boemer
Vorsitzender des Aufsichtsrates: Martin Haering 

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list