[GE issues] [Issue 3035] ENABLE_ADDGRP_KILL does not work for qrsh with command

joga Joachim.Gabler at sun.com
Tue May 19 09:57:16 BST 2009


User joga changed the following:

                What    |Old value                 |New value
             Assigned to|pollinger                 |joga

------- Additional comments from joga at sunsource.net Tue May 19 01:57:15 -0700 2009 -------

Can be reproduced with the following script:
#$ -S /bin/sh

if [ $# -ne 1 ]; then
   echo "usage: $0 <sleep time>"


setpgrp /bin/sh -c "id -a ; /usr/bin/sleep $SLEEP" &
exit 0

It's in daemons/shepherd/shepherd.c:
        if (first_kill == 0 || sig != SIGKILL || is_qrsh == false) {

Killing by additional group id is explicitly disabled for qrsh jobs with the first kill. 
A second kill operation will never be done when the job script exits within reasonable time on SIGKILL - which is to be expected.

Problem (why it probably has not been done) is, that the qrsh_starter (as well as rshd if we don't use builtin rsh_daemon) would get killed
as well.
But if the qrsh_starter is killed instead of exiting, the exit code of the command will get lost.


To unsubscribe from this discussion, e-mail: [issues-unsubscribe at gridengine.sunsource.net].

More information about the gridengine-users mailing list