[GE issues] [Issue 3035] ENABLE_ADDGRP_KILL does not work for qrsh with command

joga Joachim.Gabler at sun.com
Tue May 19 09:57:16 BST 2009


http://gridengine.sunsource.net/issues/show_bug.cgi?id=3035



User joga changed the following:

                What    |Old value                 |New value
================================================================================
             Assigned to|pollinger                 |joga
--------------------------------------------------------------------------------




------- Additional comments from joga at sunsource.net Tue May 19 01:57:15 -0700 2009 -------
Evaluation

Can be reproduced with the following script:
#!/bin/sh
#$ -S /bin/sh

if [ $# -ne 1 ]; then
   echo "usage: $0 <sleep time>"
fi

SLEEP=$1

setpgrp /bin/sh -c "id -a ; /usr/bin/sleep $SLEEP" &
wait
exit 0

It's in daemons/shepherd/shepherd.c:
        if (first_kill == 0 || sig != SIGKILL || is_qrsh == false) {

Killing by additional group id is explicitly disabled for qrsh jobs with the first kill. 
A second kill operation will never be done when the job script exits within reasonable time on SIGKILL - which is to be expected.

Problem (why it probably has not been done) is, that the qrsh_starter (as well as rshd if we don't use builtin rsh_daemon) would get killed
as well.
But if the qrsh_starter is killed instead of exiting, the exit code of the command will get lost.

------------------------------------------------------
http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=36&dsMessageId=197331

To unsubscribe from this discussion, e-mail: [issues-unsubscribe at gridengine.sunsource.net].



More information about the gridengine-users mailing list