[GE users] SGE and OpenMPI 1.3.2

jess jac67 at georgetown.edu
Thu Jan 21 16:36:41 GMT 2010


Thanks to everyone who helped me out! It is working fine now. I will 
upgrade to 6.2u5 and OpenMPI 1.4.

Jess

On 01/21/2010 10:06 AM, templedf wrote:
> olesen wrote:
>    
>>> This is what I'm using for my openmpi PE for a reference:
>>>
>>> $ qconf -sp openmpi
>>> pe_name            openmpi
>>> slots              9999
>>> user_lists         NONE
>>> xuser_lists        NONE
>>> start_proc_args    /bin/true
>>> stop_proc_args     /bin/true
>>> allocation_rule    $fill_up
>>> control_slaves     TRUE
>>> job_is_first_task  FALSE
>>> urgency_slots      min
>>> accounting_summary FALSE
>>>
>>>        
>>
>> Is '/bin/true' correct? I have
>>
>>    start_proc_args    NONE
>>    stop_proc_args     NONE
>>
>>
>>      
> NONE is equivalent to /bin/true.  Both are just no-ops.
>
>    
>>> I haven't quite decided whether or not to use
>>> "accounting_summary TRUE" yet, as it doesn't seem to account
>>> properly for parallel jobs.
>>>
>>>        
>> I don't bother with accounting there either. Instead I parse the
>> accounting file and count the slots/walltime.
>> For our system the overall time that machines and licenses are occupied
>> is the primary accounting factor.
>>
>>      
> There was a bug in u4 that prevented correct accounting of PE jobs if
> accounting_summary was TRUE.  That's fixed in u5.  accounting_summary
> tells the qmaster whether to aggregate the accounting information for
> parallel jobs into a single accounting file entry.  If you're running
> huge parallel jobs, you want it set to TRUE.
>
> Daniel
>
>    
>> /mark
>>
>> This e-mail message and any attachments may contain legally privileged, confidential or proprietary Information, or information otherwise protected by law of EMCON Technologies, its affiliates, or third parties. This notice serves as marking of its "Confidential" status as defined in any confidentiality agreements concerning the sender and recipient. If you are not the intended recipient(s), or the employee or agent responsible for delivery of this message to the intended recipient(s), you are hereby notified that any dissemination, distribution or copying of this e-mail message is strictly prohibited.
>> If you have received this message in error, please immediately notify the sender and delete this e-mail message from your computer.
>>
>>
>>

------------------------------------------------------
http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=240202

To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].



More information about the gridengine-users mailing list