Opened 11 years ago

Last modified 9 years ago

#562 new defect

IZ2695: on Linux, interactive jobs are not properly terminated when they exceed resource limits

Reported by: pollinger Owned by:
Priority: low Milestone:
Component: sge Version: 6.2
Severity: Keywords: Linux execution
Cc:

Description

[Imported from gridengine issuezilla http://gridengine.sunsource.net/issues/show_bug.cgi?id=2695]

        Issue #:      2695             Platform:     All      Reporter: pollinger (pollinger)
       Component:     gridengine          OS:        Linux
     Subcomponent:    execution        Version:      6.2         CC:    None defined
        Status:       NEW              Priority:     P4
      Resolution:                     Issue type:    DEFECT
                                   Target milestone: ---
      Assigned to:    pollinger (pollinger)
      QA Contact:     pollinger
          URL:
       * Summary:     on Linux, interactive jobs are not properly terminated when they exceed resource limits
   Status whiteboard:
      Attachments:

     Issue 2695 blocks:
   Votes for issue 2695:


   Opened: Thu Aug 21 05:05:00 -0700 2008 
------------------------


This qrsh should terminate after one minute:

submit_host% qrsh -l h_rt=0:1:0
exec_host> sh
exec_host> while test 1 = 1
> do
> date
> sleep 20
> done
Thu Aug 21 08:15:23 UTC 2008
Thu Aug 21 08:15:43 UTC 2008
Thu Aug 21 08:16:03 UTC 2008
Hangup
Thu Aug 21 08:16:06 UTC 2008
Thu Aug 21 08:16:26 UTC 2008
Thu Aug 21 08:16:46 UTC 2008
Thu Aug 21 08:17:06 UTC 2008

The QRLOGIN does not go away and the output continues.

To get rid of the job, one has to kill the qrsh command on the submit host, or
to run "qdel -f" and kill -9 the "sh" on the exec_host.

When there is no subshell created, the session gets killed after the expected time.

Change History (0)

Note: See TracTickets for help on using tickets.