[GE users] qrsh /bin/bash error mark all Queue to Error state

Reuti reuti at staff.uni-marburg.de
Thu Jul 3 16:21:15 BST 2008


Am 03.07.2008 um 17:16 schrieb Angel Arancibia:

> 2008/7/3 Reuti <reuti at staff.uni-marburg.de>:
>> Am 03.07.2008 um 16:55 schrieb Angel Arancibia:
>>
>>> 2008/7/3 Reuti <reuti at staff.uni-marburg.de>:
>>>>
>>>> Am 03.07.2008 um 14:50 schrieb Angel Arancibia:
>>>>
>>>>> <snip>
>>>>>  5480 ?        S      0:00 /home/sys/sge/bin/lx24-amd64/sge_execd
>>>>>  5500 ?        S      0:00  \_ /bin/sh -c
>>>>> /home/sys/sge/util/resources/loadsensors/misc_ifir.py
>>>>>  5501 ?        S      0:00  |   \_ /usr/bin/python -u
>>>>> /home/sys/sge/util/resources/loadsensors/misc_ifir.py
>>>>>  9372 ?        S      0:00  \_ sge_shepherd-9447 -bg
>>>>>  9373 ?        S<s    0:00      \_ sshd: aarancibia [priv]
>>>>>  9375 ?        S<     0:00          \_ sshd: aarancibia at pts/1
>>>>>  9376 pts/1    S<s    0:00              \_
>>>>> /home/sys/sge/utilbin/lx24-amd64/qrsh_starter /local/sys/sge/ 
>>>>> era-q8/a
>>>>>  9377 pts/1    S<     0:00                  \_ /bin/bash
>>>>>  9388 pts/1    R<+    0:00                      \_ ps -e f
>>>>
>>>> Did you define a priority for this queue below zero? This will  
>>>> set the
>>>> nice
>>>> value for the job, and user processes should only get 0 to 19.
>>
>> Can you then please post the last part of:
>>
>> ps -e f -o stat,nice,command
>>
>> For me I see the "<" only for processed having a nice value below  
>> zero.
>
> Yes, sory. There was the testing queue which has priority "-1", that
> why the "<". I know .... nice value below to 0 are for system process,
> but it was a testing.

Is it still putting the queues in E state (also when the priority is  
zero) when you close the window?

-- Reuti


> now,
>
> aarancibia at cluster:~$qrsh -q sistint /bin/bash
>
>
> aarancibia at era-q12:~$ps -e f
>  5426 ?        S      0:00 /home/sys/sge/bin/lx24-amd64/sge_execd
>  5430 ?        S      0:00  \_ /bin/sh -c
> /home/sys/sge/util/resources/loadsensors/misc_ifir.py
>  5432 ?        S      0:00  |   \_ /usr/bin/python -u
> /home/sys/sge/util/resources/loadsensors/misc_ifir.py
>  9261 ?        S      0:00  \_ sge_shepherd-9477 -bg
>  9262 ?        Ss     0:00      \_ sshd: aarancibia [priv]
>  9264 ?        S      0:00          \_ sshd: aarancibia at pts/0
>  9265 pts/0    Ss     0:00              \_
> /home/sys/sge/utilbin/lx24-amd64/qrsh_starter /local/sys/sge/era-q12/
>  9266 pts/0    S      0:00                  \_ /bin/bash
>  9277 pts/0    R+     0:00                      \_ ps -e f
>
> S      0 /home/sys/sge/bin/lx24-amd64/sge_execd
> S      0  \_ /bin/sh -c /home/sys/sge/util/resources/loadsensors/ 
> misc_ifir.py
> S      0  |   \_ /usr/bin/python -u
> /home/sys/sge/util/resources/loadsensors/misc_ifir.py
> S      0  \_ sge_shepherd-9477 -bg
> Ss     0      \_ sshd: aarancibia [priv]
> S      0          \_ sshd: aarancibia at pts/0
> Ss     0              \_ /home/sys/sge/utilbin/lx24-amd64/qrsh_starter
> /local/sys/sge/era-q12/active_jobs/9477.1
> S      0                  \_ /bin/bash
> R+     0                      \_ ps -e f -o stat,nice,command
>
> Thanks, and sory for the mistake.
>
> Angel
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
> For additional commands, e-mail: users-help at gridengine.sunsource.net


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list