Opened 6 years ago

Closed 6 years ago

#1480 closed defect (fixed)

Prevent root-owned files in execd active_job spool area

Reported by: markdixon Owned by:
Priority: normal Milestone:
Component: sge Version: 8.1.5
Severity: minor Keywords:
Cc:

Description

The new cgroup/cpuset code uses a couple of routines for switching effective uid/gid which appear to be causing some problems.

Some of the side symptoms include the following files in the execd spool sometimes being owned by root:

active_jobs/<JID>.<TASK>/config
active_jobs/<JID>.<TASK>/environment
active_jobs/<JID>.<TASK>/pe_hostfile
active_jobs/<JID>.<TASK>/<NUM>.<HOST>/

That last entry is a directory created for a SLAVE task. It being root-owned can cause jobs to fail with a "can't open pid file" error message.

The execd appears to have the correct euid/egid when entering the cgroup code, so I have removed the offending function calls. I don't know if there's a good reason for them that I've not noticed in limited testing.

Potential patch attached.

Cheers,

Mark
--


Mark Dixon Email : m.c.dixon@…
HPC/Grid Systems Support Tel (int): 35429
Information Systems Services Tel (ext): +44(0)113 343 5429
University of Leeds, LS2 9JT, UK


0001-Prevent-root-owned-files-in-execd-active_job-spool-a.patch

Attachments (1)

0001-Prevent-root-owned-files-in-execd-active_job-spool-a.patch (1.7 KB) - added by markdixon 6 years ago.
Added by email2trac

Download all attachments as: .zip

Change History (2)

Changed 6 years ago by markdixon

Added by email2trac

comment:1 Changed 6 years ago by dlove

  • Resolution set to fixed
  • Status changed from new to closed

Fixed by [4640]

Note: See TracTickets for help on using tickets.