[GE issues] [Issue 3249] New - PE mismatch between AR and the submitted job using it results in wrong allocation

reuti reuti at staff.uni-marburg.de
Fri Mar 12 20:59:50 GMT 2010


http://gridengine.sunsource.net/issues/show_bug.cgi?id=3249
                 Issue #|3249
                 Summary|PE mismatch between AR and the submitted job using it 
                        |results in wrong allocation
               Component|gridengine
                 Version|6.2u5
                Platform|Macintosh
                     URL|
              OS/Version|All
                  Status|NEW
       Status whiteboard|
                Keywords|
              Resolution|
              Issue type|DEFECT
                Priority|P3
            Subcomponent|scheduling
             Assigned to|andreas
             Reported by|reuti






------- Additional comments from reuti at sunsource.net Fri Mar 12 12:59:45 -0800 2010 -------
$ qrsub -pe mpich 4 -d 3600
Your advance reservation 75 has been granted
$ qrstat -ar 75
...
granted_slots_list             all.q at pc15370=2,all.q at pc15381=2
granted_parallel_environment   mpich slots 4

Then submitting a job with a different PE into this AR:

$ qsub -pe smp 2 -ar 75 test.sh
Your job 738 ("test.sh") has been submitted
$ qstat -g t
job-ID  prior   name       user         state submit/start at     queue                          master ja-task-ID 
------------------------------------------------------------------------------------------------------------------
    738 0.75500 test.sh    reuti        r     03/12/2010 21:53:08 all.q at pc15370                  SLAVE         
    738 0.75500 test.sh    reuti        r     03/12/2010 21:53:08 all.q at pc15381                  MASTER        
                                                                  all.q at pc15381                  SLAVE

PE mpich has allocation_rule $round_robin, while PE smp has allocation_rule $pe_slots

This mismatch results in confusion. The job should either:

- be rejected with something like: PE mismatch between AR and acutal request

- or the PE from the qsub request should be allocated inside the granted slots of the AR

------------------------------------------------------
http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=36&dsMessageId=248235

To unsubscribe from this discussion, e-mail: [issues-unsubscribe at gridengine.sunsource.net].



More information about the gridengine-users mailing list