[GE users] Advanced Reservation question

reuti reuti at staff.uni-marburg.de
Fri Mar 12 19:11:57 GMT 2010


Am 12.03.2010 um 19:54 schrieb mhanby:

> Hmm, next question.
>
> If I (the creator of the AR) submit a job requesting the  
> reservation, it works fine:
>
> [mikeh at cluster1]$ qsub -ar 15 qsub_lam_hello.sh
> Your job 110972 ("j_lam_hello") has been submitted
>
> However, if the other user on the access list for the AR tries to  
> submit a job:
>
> [jdoe at cluster1]$ qsub -ar 15 lammps-job.sh
> Unable to run job: error: no suitable queues.
> Exiting.

What is the state of the AR:

$ qrstat

I would assume it's in state "E" instead of "r" because of the  
following bug:

http://gridengine.sunsource.net/issues/show_bug.cgi?id=3227

-- Reuti


> $ qrstat -ar 15 |grep acl
> acl_list                       mikeh,jdoe
>
> According to the man file the user list is comma separated list of  
> UNIX users.
>
> Any suggestions?
>
> I'd prefer that the administrators create reservations for the users.
>
> Mike
> -----Original Message-----
> From: mhanby [mailto:mhanby at uab.edu]
> Sent: Friday, March 12, 2010 11:48 AM
> To: users at gridengine.sunsource.net
> Subject: RE: [GE users] Advanced Reservation question
>
> Thanks Reuti, that did the trick.
>
> AR is pretty darn cool!
>
> -----Original Message-----
> From: reuti [mailto:reuti at staff.uni-marburg.de]
> Sent: Friday, March 12, 2010 3:46 AM
> To: users at gridengine.sunsource.net
> Subject: Re: [GE users] Advanced Reservation question
>
> Hi,
>
> Am 12.03.2010 um 00:03 schrieb mhanby:
>
>> I'm just trying out reservations for the first time.
>>
>> GE 6.2u5
>>
>> I'm having trouble creating an advanced reservation for a user
>> requesting 8 nodes with 8 slots each:
>>
>> $ qrsub -q
>> '*@compute-0-1,*@compute-0-10,*@compute-0-12,*@compute-0-4,*@compute- 
>> 0
>> -6,*@compute-0-8,*@compute-0-9,*@compute-1-8' -u jdoe,mikeh -a
>> 201003120100 -e 201003211200 -m bea -M mhanby at uab.edu -N Brain01
>
> the syntax is like for a normal qsub.
>
> In the essence you need to request "-pe orte 64" or alike to reserve
> more than one slot, where the allocation rule is used to reserve the
> slots. What you then submit into this AR is unrelated, i.e. it can be
> serial job or one with a different allocation rule, which will then
> be used inside the granted slots *).
>
> -- Reuti
>
> *) At least it was the behavior when I checked it last. Now it seems
> to use always the allocation_rule of the PE of the reservation,
> although the output states:
>
> parallel environment:  smp range: 2
>
> with "allocation_rule $pe_slosts" and I get one slot on two machines.
> IMO it's a bug. It should either reject the job (no suitable queue in
> the AR, PE dont' match), or use the allocation_rule of the job inside
> the granted slots of the AR (of course with the possibility that the
> job can't be scheduled inside the reservation).
>
>
>> Your advance reservation 5 has been granted
>>
>> These are 8 nodes that are currently running jobs, but will be open
>> by the time the reservation starts
>>
>> $ qrstat -ar 5
>> --------------------------------------------------------------------- 
>> -
>> ----------
>> id                             5
>> name                           Brain01
>> owner                          mikeh
>> state                          W
>> start_time                     03/12/2010 01:00:00
>> end_time                       03/21/2010 12:00:00
>> duration                       226:00:00
>> submission_time                03/11/2010 16:53:14
>> group                          sge
>> account                        sge
>> granted_slots_list             all.q at compute-0-8.local=1
>> mail_options                   abe
>> mail_list                      root at localhost
>> acl_list                       jdoe,mikeh
>>
>> The output seems to indicate that the reservation is only being
>> created for one host and a single slot, instead of 8 hosts each
>> with 8 slots.
>>
>> Do I have the syntax wrong for GE6.2u5?
>>
>> Also, is there a way to modify an existing AR? I don't see a
>> 'qrmod' command.
>>
>>
>> =================================
>> Mike Hanby
>> mhanby at uab.edu
>> Information Systems Specialist II
>> IT HPCS / Research Computing
>>
>> ------------------------------------------------------
>> http://gridengine.sunsource.net/ds/viewMessage.do?
>> dsForumId=38&dsMessageId=248081
>>
>> To unsubscribe from this discussion, e-mail: [users-
>> unsubscribe at gridengine.sunsource.net].
>>
>
> ------------------------------------------------------
> http://gridengine.sunsource.net/ds/viewMessage.do? 
> dsForumId=38&dsMessageId=248136
>
> To unsubscribe from this discussion, e-mail: [users- 
> unsubscribe at gridengine.sunsource.net].
>
> ------------------------------------------------------
> http://gridengine.sunsource.net/ds/viewMessage.do? 
> dsForumId=38&dsMessageId=248191
>
> To unsubscribe from this discussion, e-mail: [users- 
> unsubscribe at gridengine.sunsource.net].
>
> ------------------------------------------------------
> http://gridengine.sunsource.net/ds/viewMessage.do? 
> dsForumId=38&dsMessageId=248203
>
> To unsubscribe from this discussion, e-mail: [users- 
> unsubscribe at gridengine.sunsource.net].
>

------------------------------------------------------
http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=248207

To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].



More information about the gridengine-users mailing list