[GE users] I need help with GE configuration

Esteban Freire esfreire at cesga.es
Mon Sep 1 13:15:51 BST 2008


    [ The following text is in the "ISO-8859-1" character set. ]
    [ Your display is set for the "ISO-8859-10" character set.  ]
    [ Some special characters may be displayed incorrectly. ]

Hi Reuti,

First of all, thanks for answering me :)

Reuti wrote:
> Hi,
>
> Am 28.08.2008 um 12:52 schrieb Esteban Freire:
>
>> Hi all,
>>
>> I'm getting some problems with me GE configuration. I don't 
>> understand what I'm missing, so I would appreciate your help. By the 
>> way, I'm using GE 6.1u3.
>>
>> I send attach on this e-mail the configuration files which I consider 
>> more important. The problem is the  next:
>>
>> The WN which I have configured with GE, have 8 processors, and I have 
>> the almost all configured as *complex_values        
>> num_proc=8,s_vmem=8G*,  but three of them are configured as 
>> *complex_values        num_proc=9,s_vmem=9G*, in order only a queue 
>> can see these extra processors.
>>
>> Then, I have configured the almost all queues as:
>>
>> [ .... ]
>> slots                 
>> 8,[wn001.egee.cesga.es=8],[wn002.egee.cesga.es=4], \
>>                      [wn004.egee.cesga.es=8],[wn005.egee.cesga.es=8], \
>>                      [wn006.egee.cesga.es=8],[wn007.egee.cesga.es=8], \
>>                      [wn008.egee.cesga.es=8],[wn009.egee.cesga.es=8], \
>>                      [wn010.egee.cesga.es=8],[wn011.egee.cesga.es=8], \
>>                      [wn012.egee.cesga.es=8],[wn013.egee.cesga.es=8], \
>>                      [wn014.egee.cesga.es=8]
>>
>> [ .... ]
>> complex_values        num_proc=8,s_vmem=8
> I would suggest never to touch num_proc anywhere. It's a feature of a 
> machine which is fixed. Independent from the number of real seen cores 
> in a machine, you can define as many slots as you like and 
> oversubscribe the node this way.
>
> Also your RQS per user can be implemented by using slots instead of 
> num_proc.
>
> When slots is only 8 in the above queue specification, there can't run 
> 9 jobs in it (you should never see a outpt 9/8 for used 9 out of 8 in 
> qstat). What you might notice is an oversubscription of a node due to 
> combined usage from all queues? For this limit you would need an 
> additonal RQS limiting the number of slots per machine to 8 (only the 
> normal queues), and 9 (all queues including the special queue)
>
> limit name normal queues alice, atlas, biomed, cesga, (all additonal 
> queues here) hosts {*} to slots=8
>
>
> and a second RQS:
>
> limit name all queues alice, atlas, biomed, cesga, (all additonal 
> queues here), ops hosts {*} to slots=9
>
>
> -- Reuti

Ok.  I have put num_proc, because we have configured num_proc as 
consumable (<=    FORCED      YES        0        0) and we have put 
that the users ask num_proc and s_vmem in the qsub. The problem is that 
if I don't configure num_proc in the WN and I don't send jobs asking 
num_proc, then I cannot see with a qhost how many CPUS are being used in 
*NCPU*  variable, I can see the load but it don't count the CPUS busy.

On the other hand, I have tested the  RQS rules additional that you 
commented me, the problem is that, this is sequential, I mean,  it 
starts  looking  what nodes have num_proc free, but if the first node to 
check in the list is busy, it doesn't keep looking, and therefore, this 
is not useful for me because at the end I have free CPUS which I cannot 
used because it doesn't look these nodes.

Cheers,
Esteban
>
>
>> [ .... ]
>>
>> But then I have the special queue which I want that can see the extra 
>> slots (I only want this queue can do it):
>>
>> [ .... ]
>> slots                 
>> 9,[wn004.egee.cesga.es=9],[wn001.egee.cesga.es=9], \
>>                      [wn002.egee.cesga.es=8],[wn005.egee.cesga.es=8], \
>>                      [wn006.egee.cesga.es=9],[wn007.egee.cesga.es=8], \
>>                      [wn008.egee.cesga.es=8],[wn009.egee.cesga.es=8], \
>>                      [wn010.egee.cesga.es=8],[wn011.egee.cesga.es=8], \
>>                      [wn012.egee.cesga.es=8],[wn013.egee.cesga.es=8], \
>>                      [wn014.egee.cesga.es=8]
>>
>> [ .... ]
>> complex_values        num_proc=9,s_vmem=9G
>> [ .... ]
>>
>> So, as far I understand, only this special queue could fill 9 
>> processors, but the problem is that the other queues are filling the 
>> 9 processors, when I only want they fill 8 processors, so I don't 
>> understand what I missing, maybe SGE is giving more privileges to RQS 
>> acl that the node/queue configuration complex.
>>
>> Thanks in advance,
>> Esteban
>> [root at ce2 ~]# qconf -se wn001
>> hostname              wn001.egee.cesga.es
>> load_scaling          NONE
>> complex_values        num_proc=9,s_vmem=9G
>> load_values           arch=lx26-x86,num_proc=8,mem_total=3744.000000M, \
>>                       
>> swap_total=511.992188M,virtual_total=4255.992188M, \
>>                       load_avg=7.000000,load_short=7.000000, \
>>                       load_medium=7.000000,load_long=7.070000, \
>>                       mem_free=1749.078125M,swap_free=507.320312M, \
>>                       virtual_free=2256.398438M,mem_used=1994.921875M, \
>>                       swap_used=4.671875M,virtual_used=1999.593750M, \
>>                       cpu=100.000000,np_load_avg=0.875000, \
>>                       np_load_short=0.875000,np_load_medium=0.875000, \
>>                       np_load_long=0.883750
>> processors            8
>> user_lists            NONE
>> xuser_lists           NONE
>> projects              NONE
>> xprojects             NONE
>> usage_scaling         NONE
>> report_variables      NONE
>> [root at ce2 ~]# qconf -se wn007
>> hostname              wn007.egee.cesga.es
>> load_scaling          NONE
>> complex_values        num_proc=8,s_vmem=8G
>> load_values           arch=lx26-x86,num_proc=8,mem_total=3700.000000M, \
>>                       
>> swap_total=511.992188M,virtual_total=4211.992188M, \
>>                       load_avg=6.340000,load_short=6.070000, \
>>                       load_medium=6.340000,load_long=6.630000, \
>>                       mem_free=2374.429688M,swap_free=507.851562M, \
>>                       virtual_free=2882.281250M,mem_used=1325.570312M, \
>>                       swap_used=4.140625M,virtual_used=1329.710938M, \
>>                       cpu=75.000000,np_load_avg=0.792500, \
>>                       np_load_short=0.758750,np_load_medium=0.792500, \
>>                       np_load_long=0.828750
>> processors            8
>> user_lists            NONE
>> xuser_lists           NONE
>> projects              NONE
>> xprojects             NONE
>> usage_scaling         NONE
>> report_variables      NONE
>>
>> [root at ce2 ~]# qconf -sq ops
>> qname                 ops
>> hostlist              wn001.egee.cesga.es wn002.egee.cesga.es \
>>                       wn004.egee.cesga.es wn005.egee.cesga.es \
>>                       wn006.egee.cesga.es wn007.egee.cesga.es \
>>                       wn008.egee.cesga.es wn009.egee.cesga.es \
>>                       wn010.egee.cesga.es wn011.egee.cesga.es \
>>                       wn012.egee.cesga.es wn013.egee.cesga.es \
>>                       wn014.egee.cesga.es
>> seq_no                0
>> load_thresholds       np_load_avg=1.75
>> suspend_thresholds    NONE
>> nsuspend              1
>> suspend_interval      00:05:00
>> priority              19
>> min_cpu_interval      00:05:00
>> processors            UNDEFINED
>> qtype                 BATCH INTERACTIVE
>> ckpt_list             NONE
>> pe_list               make
>> rerun                 FALSE
>> slots                 
>> 9,[wn004.egee.cesga.es=9],[wn001.egee.cesga.es=9], \
>>                       [wn002.egee.cesga.es=4],[wn005.egee.cesga.es=8], \
>>                       [wn006.egee.cesga.es=9],[wn007.egee.cesga.es=8], \
>>                       [wn008.egee.cesga.es=8],[wn009.egee.cesga.es=8], \
>>                       [wn010.egee.cesga.es=8],[wn011.egee.cesga.es=8], \
>>                       [wn012.egee.cesga.es=8],[wn013.egee.cesga.es=8], \
>>                       [wn014.egee.cesga.es=8]
>> tmpdir                /tmp
>> shell                 /bin/sh
>> prolog                NONE
>> epilog                
>> /usr/local/sge/pro/default/prolog_epilog_scripts/epilog_grid
>> shell_start_mode      posix_compliant
>> starter_method        NONE
>> suspend_method        NONE
>> resume_method         NONE
>> terminate_method      NONE
>> notify                00:00:60
>> owner_list            NONE
>> user_lists            ops opsprd opssgm
>> xuser_lists           NONE
>> subordinate_list      NONE
>> complex_values        num_proc=9,s_vmem=9G
>> projects              NONE
>> xprojects             NONE
>> calendar              NONE
>> initial_state         default
>> s_rt                  150:00:00
>> h_rt                  150:00:00
>> s_cpu                 72:00:00
>> h_cpu                 72:00:00
>> s_fsize               INFINITY
>> h_fsize               INFINITY
>> s_data                INFINITY
>> h_data                INFINITY
>> s_stack               INFINITY
>> h_stack               INFINITY
>> s_core                INFINITY
>> h_core                INFINITY
>> s_rss                 INFINITY
>> h_rss                 INFINITY
>> s_vmem                INFINITY
>> h_vmem                INFINITY
>>
>> [root at ce2 ~]# qstat -f | grep cesga
>> alice at wn001.egee.cesga.es      BIP   0/8       8.93     lx26-x86
>> alice at wn002.egee.cesga.es      BIP   0/4       -NA-     lx26-x86      au
>> alice at wn004.egee.cesga.es      BIP   0/8       8.83     lx26-x86
>> alice at wn005.egee.cesga.es      BIP   0/8       -NA-     -NA-          au
>> alice at wn006.egee.cesga.es      BIP   0/8       8.83     lx26-x86
>> alice at wn007.egee.cesga.es      BIP   0/8       6.97     lx26-x86
>> alice at wn008.egee.cesga.es      BIP   0/8       5.99     lx26-x86
>> alice at wn009.egee.cesga.es      BIP   0/8       7.92     lx26-x86
>> alice at wn010.egee.cesga.es      BIP   0/8       5.99     lx26-x86
>> alice at wn011.egee.cesga.es      BIP   0/8       6.99     lx26-x86
>> alice at wn012.egee.cesga.es      BIP   0/8       6.98     lx26-x86
>> alice at wn013.egee.cesga.es      BIP   0/8       5.96     lx26-x86
>> alice at wn014.egee.cesga.es      BIP   0/8       -NA-     -NA-          au
>> atlas at wn001.egee.cesga.es      BIP   0/8       8.93     lx26-x86
>> atlas at wn002.egee.cesga.es      BIP   0/4       -NA-     lx26-x86      au
>> atlas at wn004.egee.cesga.es      BIP   0/8       8.83     lx26-x86
>> atlas at wn005.egee.cesga.es      BIP   0/8       -NA-     -NA-          au
>> atlas at wn006.egee.cesga.es      BIP   0/8       8.83     lx26-x86
>> atlas at wn007.egee.cesga.es      BIP   0/8       6.97     lx26-x86
>> atlas at wn008.egee.cesga.es      BIP   0/8       5.99     lx26-x86
>> atlas at wn009.egee.cesga.es      BIP   0/8       7.92     lx26-x86
>> atlas at wn010.egee.cesga.es      BIP   0/8       5.99     lx26-x86
>> atlas at wn011.egee.cesga.es      BIP   0/8       6.99     lx26-x86
>> atlas at wn012.egee.cesga.es      BIP   0/8       6.98     lx26-x86
>> atlas at wn013.egee.cesga.es      BIP   0/8       5.96     lx26-x86
>> atlas at wn014.egee.cesga.es      BIP   0/8       -NA-     -NA-          au
>> biomed at wn001.egee.cesga.es     BIP   0/8       8.93     lx26-x86
>> biomed at wn002.egee.cesga.es     BIP   0/4       -NA-     lx26-x86      au
>> biomed at wn004.egee.cesga.es     BIP   0/8       8.83     lx26-x86
>> biomed at wn005.egee.cesga.es     BIP   0/8       -NA-     -NA-          au
>> biomed at wn006.egee.cesga.es     BIP   0/8       8.83     lx26-x86
>> biomed at wn007.egee.cesga.es     BIP   1/8       6.97     lx26-x86
>> biomed at wn008.egee.cesga.es     BIP   1/8       5.99     lx26-x86
>> biomed at wn009.egee.cesga.es     BIP   0/8       7.92     lx26-x86
>> biomed at wn010.egee.cesga.es     BIP   2/8       5.99     lx26-x86
>> biomed at wn011.egee.cesga.es     BIP   0/8       6.99     lx26-x86
>> biomed at wn012.egee.cesga.es     BIP   1/8       6.98     lx26-x86
>> biomed at wn013.egee.cesga.es     BIP   1/8       5.96     lx26-x86
>> biomed at wn014.egee.cesga.es     BIP   0/8       -NA-     -NA-          au
>> cesga at wn001.egee.cesga.es      BIP   5/8       8.93     lx26-x86
>>   29784 0.05803 STDIN      cesga050     r     08/28/2008 11:43:47     1
>>   29799 0.05513 STDIN      cesga050     r     08/28/2008 11:50:47     1
>>   29801 0.05507 STDIN      cesga050     r     08/28/2008 11:50:47     1
>>   29805 0.05483 STDIN      cesga050     r     08/28/2008 11:54:47     1
>>   29806 0.05480 STDIN      cesga050     r     08/28/2008 11:58:02     1
>> cesga at wn002.egee.cesga.es      BIP   0/4       -NA-     lx26-x86      au
>> cesga at wn004.egee.cesga.es      BIP   6/8       8.83     lx26-x86
>>   29774 0.05821 STDIN      cesga050     r     08/28/2008 11:43:32     1
>>   29775 0.05816 STDIN      cesga050     r     08/28/2008 11:43:32     1
>>   29780 0.05808 STDIN      cesga050     r     08/28/2008 11:43:47     1
>>   29785 0.05802 STDIN      cesga050     r     08/28/2008 11:43:47     1
>>   29792 0.05784 STDIN      cesga050     r     08/28/2008 11:44:17     1
>>   29802 0.05504 STDIN      cesga050     r     08/28/2008 11:51:02     1
>> cesga at wn005.egee.cesga.es      BIP   0/8       -NA-     -NA-          au
>> cesga at wn006.egee.cesga.es      BIP   4/8       8.83     lx26-x86
>>   29791 0.05791 STDIN      cesga050     r     08/28/2008 11:44:02     1
>>   29797 0.05776 STDIN      cesga050     r     08/28/2008 11:44:32     1
>>   29800 0.05507 STDIN      cesga050     r     08/28/2008 11:50:47     1
>>   29803 0.05501 STDIN      cesga050     r     08/28/2008 11:51:02     1
>> cesga at wn007.egee.cesga.es      BIP   2/8       6.97     lx26-x86
>>   29788 0.05793 STDIN      cesga050     r     08/28/2008 11:44:02     1
>>   29794 0.05780 STDIN      cesga050     r     08/28/2008 11:44:32     1
>> cesga at wn008.egee.cesga.es      BIP   2/8       5.99     lx26-x86
>>   29777 0.05813 STDIN      cesga050     r     08/28/2008 11:43:32     1
>>   29782 0.05805 STDIN      cesga050     r     08/28/2008 11:43:47     1
>> cesga at wn009.egee.cesga.es      BIP   4/8       7.92     lx26-x86
>>   29778 0.05808 STDIN      cesga050     r     08/28/2008 11:43:47     1
>>   29783 0.05805 STDIN      cesga050     r     08/28/2008 11:43:47     1
>>   29786 0.05801 STDIN      cesga050     r     08/28/2008 11:44:02     1
>>   29798 0.05776 STDIN      cesga050     r     08/28/2008 11:44:32     1
>> cesga at wn010.egee.cesga.es      BIP   1/8       5.99     lx26-x86
>>   29789 0.05792 STDIN      cesga050     r     08/28/2008 11:44:02     1
>> cesga at wn011.egee.cesga.es      BIP   2/8       6.99     lx26-x86
>>   29790 0.05792 STDIN      cesga050     r     08/28/2008 11:44:02     1
>>   29796 0.05777 STDIN      cesga050     r     08/28/2008 11:44:32     1
>> cesga at wn012.egee.cesga.es      BIP   3/8       6.98     lx26-x86
>>   29787 0.05801 STDIN      cesga050     r     08/28/2008 11:44:02     1
>>   29795 0.05777 STDIN      cesga050     r     08/28/2008 11:44:32     1
>>   29804 0.05500 STDIN      cesga050     r     08/28/2008 11:51:17     1
>> cesga at wn013.egee.cesga.es      BIP   2/8       5.96     lx26-x86
>>   29776 0.05813 STDIN      cesga050     r     08/28/2008 11:43:32     1
>>   29781 0.05806 STDIN      cesga050     r     08/28/2008 11:43:47     1
>> cesga at wn014.egee.cesga.es      BIP   0/8       -NA-     -NA-          au
>> cms at wn001.egee.cesga.es        BIP   0/8       8.93     lx26-x86
>> cms at wn002.egee.cesga.es        BIP   0/4       -NA-     lx26-x86      au
>> cms at wn004.egee.cesga.es        BIP   0/8       8.83     lx26-x86
>> cms at wn005.egee.cesga.es        BIP   0/8       -NA-     -NA-          au
>> cms at wn006.egee.cesga.es        BIP   0/8       8.83     lx26-x86
>> cms at wn007.egee.cesga.es        BIP   0/8       6.97     lx26-x86
>> cms at wn008.egee.cesga.es        BIP   0/8       5.99     lx26-x86
>> cms at wn009.egee.cesga.es        BIP   0/8       7.92     lx26-x86
>> cms at wn010.egee.cesga.es        BIP   0/8       5.99     lx26-x86
>> cms at wn011.egee.cesga.es        BIP   0/8       6.99     lx26-x86
>> cms at wn012.egee.cesga.es        BIP   0/8       6.98     lx26-x86
>> cms at wn013.egee.cesga.es        BIP   0/8       5.96     lx26-x86
>> cms at wn014.egee.cesga.es        BIP   0/8       -NA-     -NA-          au
>> compchem at wn001.egee.cesga.es   BIP   4/8       8.93     lx26-x86
>> compchem at wn002.egee.cesga.es   BIP   0/4       -NA-     lx26-x86      au
>> compchem at wn004.egee.cesga.es   BIP   3/8       8.83     lx26-x86
>> compchem at wn005.egee.cesga.es   BIP   0/8       -NA-     -NA-          au
>> compchem at wn006.egee.cesga.es   BIP   5/8       8.83     lx26-x86
>> compchem at wn007.egee.cesga.es   BIP   4/8       6.97     lx26-x86
>> compchem at wn008.egee.cesga.es   BIP   4/8       5.99     lx26-x86
>> compchem at wn009.egee.cesga.es   BIP   4/8       7.92     lx26-x86
>> compchem at wn010.egee.cesga.es   BIP   4/8       5.99     lx26-x86
>> compchem at wn011.egee.cesga.es   BIP   5/8       6.99     lx26-x86
>> compchem at wn012.egee.cesga.es   BIP   3/8       6.98     lx26-x86
>> compchem at wn013.egee.cesga.es   BIP   3/8       5.96     lx26-x86
>> compchem at wn014.egee.cesga.es   BIP   0/8       -NA-     -NA-          au
>> diligent at wn001.egee.cesga.es   BIP   0/8       8.93     lx26-x86
>> diligent at wn002.egee.cesga.es   BIP   0/4       -NA-     lx26-x86      au
>> diligent at wn004.egee.cesga.es   BIP   0/8       8.83     lx26-x86
>> diligent at wn005.egee.cesga.es   BIP   0/8       -NA-     -NA-          au
>> diligent at wn006.egee.cesga.es   BIP   0/8       8.83     lx26-x86
>> diligent at wn007.egee.cesga.es   BIP   0/8       6.97     lx26-x86
>> diligent at wn008.egee.cesga.es   BIP   0/8       5.99     lx26-x86
>> diligent at wn009.egee.cesga.es   BIP   0/8       7.92     lx26-x86
>> diligent at wn010.egee.cesga.es   BIP   0/8       5.99     lx26-x86
>> diligent at wn011.egee.cesga.es   BIP   0/8       6.99     lx26-x86
>> diligent at wn012.egee.cesga.es   BIP   0/8       6.98     lx26-x86
>> diligent at wn013.egee.cesga.es   BIP   0/8       5.96     lx26-x86
>> diligent at wn014.egee.cesga.es   BIP   0/8       -NA-     -NA-          au
>> dteam at wn001.egee.cesga.es      BIP   0/8       8.93     lx26-x86
>> dteam at wn002.egee.cesga.es      BIP   0/4       -NA-     lx26-x86      au
>> dteam at wn004.egee.cesga.es      BIP   0/8       8.83     lx26-x86
>> dteam at wn005.egee.cesga.es      BIP   0/8       -NA-     -NA-          au
>> dteam at wn006.egee.cesga.es      BIP   0/8       8.83     lx26-x86
>> dteam at wn007.egee.cesga.es      BIP   0/8       6.97     lx26-x86
>> dteam at wn008.egee.cesga.es      BIP   0/8       5.99     lx26-x86
>> dteam at wn009.egee.cesga.es      BIP   0/8       7.92     lx26-x86
>> dteam at wn010.egee.cesga.es      BIP   0/8       5.99     lx26-x86
>> dteam at wn011.egee.cesga.es      BIP   0/8       6.99     lx26-x86
>> dteam at wn012.egee.cesga.es      BIP   0/8       6.98     lx26-x86
>> dteam at wn013.egee.cesga.es      BIP   0/8       5.96     lx26-x86
>> dteam at wn014.egee.cesga.es      BIP   0/8       -NA-     -NA-          au
>> fusion at wn001.egee.cesga.es     BIP   0/8       8.93     lx26-x86
>> fusion at wn002.egee.cesga.es     BIP   0/4       -NA-     lx26-x86      au
>> fusion at wn004.egee.cesga.es     BIP   0/8       8.83     lx26-x86
>> fusion at wn005.egee.cesga.es     BIP   0/8       -NA-     -NA-          au
>> fusion at wn006.egee.cesga.es     BIP   0/8       8.83     lx26-x86
>> fusion at wn007.egee.cesga.es     BIP   0/8       6.97     lx26-x86
>> fusion at wn008.egee.cesga.es     BIP   0/8       5.99     lx26-x86
>> fusion at wn009.egee.cesga.es     BIP   0/8       7.92     lx26-x86
>> fusion at wn010.egee.cesga.es     BIP   0/8       5.99     lx26-x86
>> fusion at wn011.egee.cesga.es     BIP   0/8       6.99     lx26-x86
>> fusion at wn012.egee.cesga.es     BIP   0/8       6.98     lx26-x86
>> fusion at wn013.egee.cesga.es     BIP   0/8       5.96     lx26-x86
>> fusion at wn014.egee.cesga.es     BIP   0/8       -NA-     -NA-          au
>> imath at wn001.egee.cesga.es      BIP   0/8       8.93     lx26-x86
>> imath at wn002.egee.cesga.es      BIP   0/4       -NA-     lx26-x86      au
>> imath at wn004.egee.cesga.es      BIP   0/8       8.83     lx26-x86
>> imath at wn005.egee.cesga.es      BIP   0/8       -NA-     -NA-          au
>> imath at wn006.egee.cesga.es      BIP   0/8       8.83     lx26-x86
>> imath at wn007.egee.cesga.es      BIP   0/8       6.97     lx26-x86
>> imath at wn008.egee.cesga.es      BIP   0/8       5.99     lx26-x86
>> imath at wn009.egee.cesga.es      BIP   0/8       7.92     lx26-x86
>> imath at wn010.egee.cesga.es      BIP   0/8       5.99     lx26-x86
>> imath at wn011.egee.cesga.es      BIP   0/8       6.99     lx26-x86
>> imath at wn012.egee.cesga.es      BIP   0/8       6.98     lx26-x86
>> imath at wn013.egee.cesga.es      BIP   0/8       5.96     lx26-x86
>> imath at wn014.egee.cesga.es      BIP   0/8       -NA-     -NA-          au
>> lhcb at wn001.egee.cesga.es       BIP   0/8       8.93     lx26-x86
>> lhcb at wn002.egee.cesga.es       BIP   0/4       -NA-     lx26-x86      au
>> lhcb at wn004.egee.cesga.es       BIP   0/8       8.83     lx26-x86
>> lhcb at wn005.egee.cesga.es       BIP   0/8       -NA-     -NA-          au
>> lhcb at wn006.egee.cesga.es       BIP   0/9       8.83     lx26-x86
>> lhcb at wn007.egee.cesga.es       BIP   0/8       6.97     lx26-x86
>> lhcb at wn008.egee.cesga.es       BIP   0/8       5.99     lx26-x86
>> lhcb at wn009.egee.cesga.es       BIP   0/8       7.92     lx26-x86
>> lhcb at wn010.egee.cesga.es       BIP   0/8       5.99     lx26-x86
>> lhcb at wn011.egee.cesga.es       BIP   0/8       6.99     lx26-x86
>> lhcb at wn012.egee.cesga.es       BIP   0/8       6.98     lx26-x86
>> lhcb at wn013.egee.cesga.es       BIP   0/8       5.96     lx26-x86
>> lhcb at wn014.egee.cesga.es       BIP   0/8       -NA-     -NA-          au
>> ops at wn001.egee.cesga.es        BIP   0/9       8.93     lx26-x86
>> ops at wn002.egee.cesga.es        BIP   0/4       -NA-     lx26-x86      au
>> ops at wn004.egee.cesga.es        BIP   0/9       8.83     lx26-x86
>> ops at wn005.egee.cesga.es        BIP   0/8       -NA-     -NA-          au
>> ops at wn006.egee.cesga.es        BIP   0/9       8.83     lx26-x86
>> ops at wn007.egee.cesga.es        BIP   0/8       6.97     lx26-x86
>> ops at wn008.egee.cesga.es        BIP   0/8       5.99     lx26-x86
>> ops at wn009.egee.cesga.es        BIP   0/8       7.92     lx26-x86
>> ops at wn010.egee.cesga.es        BIP   0/8       5.99     lx26-x86
>> ops at wn011.egee.cesga.es        BIP   0/8       6.99     lx26-x86
>> ops at wn012.egee.cesga.es        BIP   0/8       6.98     lx26-x86
>> ops at wn013.egee.cesga.es        BIP   0/8       5.96     lx26-x86
>> ops at wn014.egee.cesga.es        BIP   0/8       -NA-     -NA-          au
>> swetest at wn001.egee.cesga.es    BIP   0/8       8.93     lx26-x86
>> swetest at wn002.egee.cesga.es    BIP   0/4       -NA-     lx26-x86      au
>> swetest at wn004.egee.cesga.es    BIP   0/8       8.83     lx26-x86
>> swetest at wn005.egee.cesga.es    BIP   0/8       -NA-     -NA-          au
>> swetest at wn006.egee.cesga.es    BIP   0/8       8.83     lx26-x86
>> swetest at wn007.egee.cesga.es    BIP   0/8       6.97     lx26-x86
>> swetest at wn008.egee.cesga.es    BIP   0/8       5.99     lx26-x86
>> swetest at wn009.egee.cesga.es    BIP   0/8       7.92     lx26-x86
>> swetest at wn010.egee.cesga.es    BIP   0/8       5.99     lx26-x86
>> swetest at wn011.egee.cesga.es    BIP   0/8       6.99     lx26-x86
>> swetest at wn012.egee.cesga.es    BIP   0/8       6.98     lx26-x86
>> swetest at wn013.egee.cesga.es    BIP   0/8       5.96     lx26-x86
>> swetest at wn014.egee.cesga.es    BIP   0/8       -NA-     -NA-          au
>>   29807 0.05479 STDIN      cesga050     qw    08/28/2008 11:51:22     1
>>   29808 0.05479 STDIN      cesga050     qw    08/28/2008 11:51:23     1
>>   29809 0.05477 STDIN      cesga050     qw    08/28/2008 11:51:25     1
>>   29810 0.05477 STDIN      cesga050     qw    08/28/2008 11:51:26     1
>>   29811 0.05476 STDIN      cesga050     qw    08/28/2008 11:51:27     1
>>   29812 0.05474 STDIN      cesga050     qw    08/28/2008 11:51:29     1
>>   29813 0.05474 STDIN      cesga050     qw    08/28/2008 11:51:30     1
>>   29814 0.05473 STDIN      cesga050     qw    08/28/2008 11:51:31     1
>>   29815 0.05471 STDIN      cesga050     qw    08/28/2008 11:51:34     1
>>   29817 0.05470 STDIN      cesga050     qw    08/28/2008 11:51:35     1
>>   29818 0.05469 STDIN      cesga050     qw    08/28/2008 11:51:37     1
>>   29819 0.05466 STDIN      cesga050     qw    08/28/2008 11:51:41     1
>>   29820 0.05465 STDIN      cesga050     qw    08/28/2008 11:51:42     1
>>   29821 0.05464 STDIN      cesga050     qw    08/28/2008 11:51:43     1
>>   29822 0.05464 STDIN      cesga050     qw    08/28/2008 11:51:44     1
>>   29823 0.05463 STDIN      cesga050     qw    08/28/2008 11:51:45     1
>>   29824 0.05463 STDIN      cesga050     qw    08/28/2008 11:51:45     1
>>
>> [root at ce2 ~]# qstat -f | grep ops
>> ops at wn001.egee.cesga.es        BIP   0/9       8.96     lx26-x86
>> ops at wn002.egee.cesga.es        BIP   0/4       -NA-     lx26-x86      au
>> ops at wn004.egee.cesga.es        BIP   0/9       8.85     lx26-x86
>> ops at wn005.egee.cesga.es        BIP   0/8       -NA-     -NA-          au
>> ops at wn006.egee.cesga.es        BIP   0/9       8.85     lx26-x86
>> ops at wn007.egee.cesga.es        BIP   0/8       6.99     lx26-x86
>> ops at wn008.egee.cesga.es        BIP   0/8       5.99     lx26-x86
>> ops at wn009.egee.cesga.es        BIP   0/8       7.93     lx26-x86
>> ops at wn010.egee.cesga.es        BIP   0/8       5.99     lx26-x86
>> ops at wn011.egee.cesga.es        BIP   0/8       7.00     lx26-x86
>> ops at wn012.egee.cesga.es        BIP   0/8       6.98     lx26-x86
>> ops at wn013.egee.cesga.es        BIP   0/8       5.98     lx26-x86
>> ops at wn014.egee.cesga.es        BIP   0/8       -NA-     -NA-          au
>>   29825 0.06011 STDIN      opssgm004    qw    08/28/2008 12:02:24     1
>>
>> [root at ce2 ~]# qstat -j 29825
>> ==============================================================
>> job_number:                 29825
>> exec_file:                  job_scripts/29825
>> submission_time:            Thu Aug 28 12:02:24 2008
>> owner:                      opssgm004
>> uid:                        30113
>> group:                      opssgm
>> gid:                        30003
>> sge_o_home:                 /home/glite/opssgm004
>> sge_o_log_name:             opssgm004
>> sge_o_path:                 
>> /usr/local/sge/pro/bin/lx26-x86:/usr/kerberos/sbin:/usr/kerberos/bin:/opt/edg/bin:/opt/glite/bin:/opt/lcg/bin:/usr/java/jdk1.5.0_14/bin:/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/usr/X11R6/bin:/opt/globus/bin:/root/bin 
>>
>> sge_o_shell:                /bin/bash
>> sge_o_workdir:              /home/glite/opssgm004
>> sge_o_host:                 ce2
>> account:                    sge
>> cwd:                        /home/glite/opssgm004
>> hard resource_list:         s_vmem=1G,num_proc=1
>> mail_list:                  opssgm004 at ce2.egee.cesga.es
>> notify:                     FALSE
>> job_name:                   STDIN
>> jobshare:                   0
>> hard_queue_list:            ops
>> shell_list:                 env_list:
>> script_file:                STDIN
>> scheduling info:            queue instance 
>> "alice at wn014.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance 
>> "alice at wn002.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance 
>> "alice at wn005.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance 
>> "biomed at wn014.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance 
>> "biomed at wn002.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance 
>> "biomed at wn005.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance 
>> "cesga at wn014.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance 
>> "cesga at wn002.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance 
>> "cesga at wn005.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance "cms at wn014.egee.cesga.es" 
>> dropped because it is temporarily not available
>>                             queue instance "cms at wn002.egee.cesga.es" 
>> dropped because it is temporarily not available
>>                             queue instance "cms at wn005.egee.cesga.es" 
>> dropped because it is temporarily not available
>>                             queue instance 
>> "compchem at wn014.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance 
>> "compchem at wn002.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance 
>> "compchem at wn005.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance 
>> "diligent at wn014.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance 
>> "diligent at wn002.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance 
>> "diligent at wn005.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance 
>> "dteam at wn014.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance 
>> "dteam at wn002.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance 
>> "dteam at wn005.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance 
>> "fusion at wn014.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance 
>> "fusion at wn002.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance 
>> "fusion at wn005.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance 
>> "imath at wn014.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance 
>> "imath at wn002.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance 
>> "imath at wn005.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance "lhcb at wn014.egee.cesga.es" 
>> dropped because it is temporarily not available
>>                             queue instance "lhcb at wn002.egee.cesga.es" 
>> dropped because it is temporarily not available
>>                             queue instance "lhcb at wn005.egee.cesga.es" 
>> dropped because it is temporarily not available
>>                             queue instance 
>> "swetest at wn014.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance 
>> "swetest at wn002.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance 
>> "swetest at wn005.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance "ops at wn014.egee.cesga.es" 
>> dropped because it is temporarily not available
>>                             queue instance "ops at wn002.egee.cesga.es" 
>> dropped because it is temporarily not available
>>                             queue instance "ops at wn005.egee.cesga.es" 
>> dropped because it is temporarily not available
>>                             queue instance 
>> "atlas at wn014.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance 
>> "atlas at wn002.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             queue instance 
>> "atlas at wn005.egee.cesga.es" dropped because it is temporarily not 
>> available
>>                             cannot run in queue "alice" because it is 
>> not contained in its hard queue list (-q)
>>                             cannot run in queue "biomed" because it 
>> is not contained in its hard queue list (-q)
>>                             cannot run in queue "cesga" because it is 
>> not contained in its hard queue list (-q)
>>                             cannot run in queue "cms" because it is 
>> not contained in its hard queue list (-q)
>>                             cannot run in queue "compchem" because it 
>> is not contained in its hard queue list (-q)
>>                             cannot run in queue "diligent" because it 
>> is not contained in its hard queue list (-q)
>>                             cannot run in queue "dteam" because it is 
>> not contained in its hard queue list (-q)
>>                             cannot run in queue "fusion" because it 
>> is not contained in its hard queue list (-q)
>>                             cannot run in queue "imath" because it is 
>> not contained in its hard queue list (-q)
>>                             cannot run in queue "lhcb" because it is 
>> not contained in its hard queue list (-q)
>>                             cannot run in queue "swetest" because it 
>> is not contained in its hard queue list (-q)
>>                             (-l num_proc=1,s_vmem=1G) cannot run at 
>> host "wn013.egee.cesga.es" because it offers only hc:num_proc=0.000000
>>                             (-l num_proc=1,s_vmem=1G) cannot run at 
>> host "wn008.egee.cesga.es" because it offers only hc:num_proc=0.000000
>>                             (-l num_proc=1,s_vmem=1G) cannot run at 
>> host "wn010.egee.cesga.es" because it offers only hc:num_proc=0.000000
>>                             (-l num_proc=1,s_vmem=1G) cannot run at 
>> host "wn012.egee.cesga.es" because it offers only hc:num_proc=0.000000
>>                             (-l num_proc=1,s_vmem=1G) cannot run at 
>> host "wn007.egee.cesga.es" because it offers only hc:num_proc=0.000000
>>                             (-l num_proc=1,s_vmem=1G) cannot run at 
>> host "wn011.egee.cesga.es" because it offers only hc:num_proc=0.000000
>>                             (-l num_proc=1,s_vmem=1G) cannot run at 
>> host "wn009.egee.cesga.es" because it offers only hc:num_proc=0.000000
>>                             (-l num_proc=1,s_vmem=1G) cannot run at 
>> host "wn006.egee.cesga.es" because it offers only hc:num_proc=0.000000
>>                             (-l num_proc=1,s_vmem=1G) cannot run at 
>> host "wn004.egee.cesga.es" because it offers only hc:num_proc=0.000000
>>                             (-l num_proc=1,s_vmem=1G) cannot run at 
>> host "wn001.egee.cesga.es" because it offers only hc:num_proc=0.000000
>>
>> {
>>    name         maxujobs
>>    description  NONE
>>    enabled      TRUE
>>    limit        users @ops to num_proc=10
>>    limit        users @opssgm to num_proc=10
>>    limit        users @dteam to num_proc=10
>>    limit        users @swetest to num_proc=2
>>    limit        users @cesga to num_proc=100
>>    limit        users @imath to num_proc=100
>>    limit        users @lhcb to num_proc=50
>>    limit        users @lhcbprd to num_proc=3
>>    limit        users @lhcbsgm to num_proc=20
>>    limit        users @compchem to num_proc=40
>>    limit        users @fusion to num_proc=20
>>    limit        users @biomed to num_proc=30
>>    limit        users @biomedsgm to num_proc=14
>>    limit        users @alice to num_proc=30
>>    limit        users @alicesgm to num_proc=4
>>    limit        users @atlas to num_proc=20
>>    limit        users @atlassgm to num_proc=3
>>    limit        users @cms to num_proc=10
>> }
>>
>>


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list