[GE users] CPU limit in mpi jobs

Reuti reuti at staff.uni-marburg.de
Fri Jun 9 22:13:59 BST 2006


Hi again,

the CPU limit is working in principle, but for now there is a  
possible race condition:

http://gridengine.sunsource.net/issues/show_bug.cgi?id=1960

The job will disappear, but some slaves keep on running.


To the usage: the usage of a parallel job is working for me. Can you  
try after a normal finished job:

qacct -j <jobid>

which should show also one entry for each qrsh call.

-- Reuti


Am 08.06.2006 um 20:22 schrieb Rui Ramos:

> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
>
>  Hi all,
>
>  Well i've tried setting the tight mpi integration with no luck.  
> I've also follow the LAM/MPI integration and set it with tight  
> integration. It seems to work for some jobs still have to make some  
> more tests. Anyway ! the cpu limit of the mpi jobs is allways  
> 00:00:00.
>
>  Does anybody have CPU limits working with mpi jobs ?
>
>                                                    Apreciate any  
> help :)
>
> PS: Yes is a tight integration of the LAM/MPI like explained in  
> Reuti howto.
>
> On Fri, 2 Jun 2006 16:55:05 +0100
> Rui Ramos <rramos at iric.up.pt> wrote:
>
>>
>>  Well i guess i don't have the tight integration. I'm reading your  
>> howto and the symptoms are the ones referenced.
>>
>>     http://gridengine.sunsource.net/howto/mpich-integration.html
>>
>>                                                                   
>> Regards, going to try it out
>>
>> On Fri, 2 Jun 2006 17:40:50 +0200
>> Reuti <reuti at staff.uni-marburg.de> wrote:
>>
>>> Hi,
>>>
>>> Am 02.06.2006 um 17:39 schrieb Rui Ramos:
>>>
>>>>
>>>>  Hi all,
>>>>
>>>>  I've set CPU limits in some of my queues. But there is something
>>>> that worries me. When submitting an mpi job this CPU limit, is set
>>>> to each mpi instance or to the sum of the all instances ?
>>>>  Another thing is when doing a qstat i get
>>>>
>>>> usage    1:                 cpu=00:00:00, mem=0.00050 GBs,
>>>> io=0.00000, vmem=121.828M, maxvmem=121.828M
>>>>
>>>>  And the cpu time is allways 00:00:00. Is the CPU limit really
>>>> working with mpi jobs ?
>>>
>>> is it a Tightly Integrated setup?- Reuti
>>>
>>>>                                                    thanks in  
>>>> advance
>>>>
>> -- 
>> ============================================
>>  Rui Manuel dos Santos Ramos
>>
>>  Instituto de Recursos e Iniciativas Comuns
>>  Pra_a Gomes Teixeira, 4099-002 Porto, Portugal
>>
>>  phone : +351 223 401 571
>>  e-mail: rramos[at]iric.up.pt
>>     web: http://ruiramos.homeip.net
>> ============================================
>>
>>
>
>
> - --
> ============================================
>  Rui Manuel dos Santos Ramos
>
>  Instituto de Recursos e Iniciativas Comuns
>  Praca Gomes Teixeira, 4099-002 Porto, Portugal
>
>  phone : +351 223 401 571
>  e-mail: rramos[at]iric.up.pt
>     web: http://ruiramos.homeip.net
> ============================================
>
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.2.2 (GNU/Linux)
>
> iQEVAwUBRIhqz71uR0bdnTWSAQIA3Qf/Xhh3qXS+tDaGNY4Jb3p7a1dBbiYeBk11
> qPDCrX31GxNndfE5H6TWrIZbXwk1eCQQud8eShOyFeEWJYx95J43uE46NL5L7rqZ
> IXh2ZgqyaB+aG8AUU3Q/B/TItZz3TfiJmyAQHFVPn1+chQtnGKbloOnk+Cf11Cp+
> u0bPe/hfeyRsTVP4UPGwCFO4B0Q9buanvPvwwvyPi2VNL6pINLc6ym54hQTubDqP
> 3pxzKCzCvs3BkFk3NpzQIXpNPRkEnFaQSXiDZi/5K4mEBhbi9PvJNfS6zej7NlTW
> dGSqcyMSgn3prjVF2RFpRrXWh2OsMndgt8sxkQ5KSQGIg4wCdpD+dQ==
> =q47k
> -----END PGP SIGNATURE-----
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
> For additional commands, e-mail: users-help at gridengine.sunsource.net

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list