Custom Query (431 matches)

Filters
 
Or
 
  
 
Columns

Show under each result:


Results (52 - 54 of 431)

Ticket Resolution Summary Owner Reporter
#507 fixed IZ2552: dump if SGE daemons crash when admin_user != "root" andreas
Description

[Imported from gridengine issuezilla http://gridengine.sunsource.net/issues/show_bug.cgi?id=2552]

   Issue #: 2552   Platform: All   Reporter: andreas (andreas)
   Component: gridengine   OS: All
   Subcomponent: kernel   Version: 6.1AR_snapshot3_6   CC: None defined
   Status: REOPENED   Priority: P2
   Resolution:   Issue type: DEFECT
     Target milestone: ---
   Assigned to: andreas (andreas)
   QA Contact: andreas
   URL:
   * Summary: No core dump if SGE daemons crash when admin_user != "root"
   Status whiteboard:
   Attachments:
   Date/filename:                                Description:                                                                          Submitted by:
   Fri Apr 11 08:10:00 -0700 2008: libcore.so.gz libcore.so for AMD64 Linux (application/x-gzip)                                       andreas
   Fri Apr 11 08:12:00 -0700 2008: libcore.c     Source code for libcore.so (text/plain)                                               andreas
   Mon Apr 28 04:00:00 -0700 2008: libcore.so.gz libcore.so for lx24-ia64 (application/x-gzip)                                         andreas
   Mon Apr 28 04:01:00 -0700 2008: libcore.so.gz libcore.so for lx24-x86 (text/plain)                                                  andreas
   Mon Apr 28 06:49:00 -0700 2008: 2552.diff     Proposed patch (maintrunk) (text/plain)                                               andreas
   Tue May 13 02:23:00 -0700 2008: build.sh      Build.sh that I used to build libcore.so from libcore.c attached earlier (text/plain) andreas
     Issue 2552 blocks:
   Votes for issue 2552:

   Opened: Thu Apr 10 02:51:00 -0700 2008 
------------------------


DESCRIPTION:
When SGE daemons crash no core file gets written if admin_user != "root" due to
security concerns.

WORKAROUND/FIX:
Under Solaris coreadm(1) can be used to give the kernel a waiver (per
process/globally) so that core files get written in this case.

Under Linux there are two means:
(1) For overriding it for all processes there is a

      # sysctl -w kernel.core_setuid_ok=1

    it is mentioned in

      http://kbase.redhat.com/faq/FAQ_49_3652.shtm

    for RHEL3 so I would assume it works in RHEL4 as well

(2) For overriding it indivudually there is a call

      prctl(PR_SET_DUMPABLE,1,42,42,42);

    due to

      https://bugzilla.redhat.com/show_bug.cgi?id=104310

    mentioning it as a bug when it is broke I would assume one can rely on it

   ------- Additional comments from andreas Thu Apr 10 05:00:34 -0700 2008 -------
Use of

  prctl(PR_SET_DUMPABLE,1,42,42,42)

under Linux seems problematic as it were necessary to issue this prctl() anew
each time uid/euid changes:

  http://linux-documentation.com/en/man/man2/prctl.html

   ------- Additional comments from andreas Thu Apr 10 05:38:01 -0700 2008 -------
Best approach to address this issue is to have the documentation explain how to
still get the core file.

Plan is to add a trouble shooting section to 6.2 Install Guide that refers
coreadm(1M) and sysctl -w kernel.core_setuid_ok

   ------- Additional comments from andreas Fri Apr 11 08:07:50 -0700 2008 -------
As it turned out that e.g. RHEL4 does not know

# sysctl -w kernel.core_setuid_ok=1

anymore the only resort to get a core dump under Linux appears to issue

   prctl(PR_SET_DUMPABLE,1,42,42,42);

after each call to setuid(), seteuid(), setgid(), and setegid().

As workaround the use of libcore.so using LD_PRELOAD turned out to solve the
issue. E.g. to apply it for sge_execd one must change in

   $SGE_ROOT/$SGE_CELL/common/sgeexecd

the line

    $bin_dir/sge_execd

where sge_execd is started into

    env LD_PRELOAD=/path/to/libcore.so $bin_dir/sge_execd

after execd restart a nice core.<pid> file is written in the spool directory
$SGE_ROOT/$SGE_CELL/spool/<host>/ of this execd when it gets killed using

    # kill -SEGV <pid>

LD_PRELOAD though gets inherited to shepherds processes that are forked by such
an execd, but the jobs themselfs will not have it in their environments, except
if one was adding INHERIT_ENV=LD_PRELOAD to the execd_params section of the
cluster configuration sge_conf(5).

   ------- Additional comments from andreas Fri Apr 11 08:10:12 -0700 2008 -------
Created an attachment (id=164)
libcore.so for AMD64 Linux

   ------- Additional comments from andreas Fri Apr 11 08:12:04 -0700 2008 -------
Created an attachment (id=165)
Source code for libcore.so

   ------- Additional comments from andreas Mon Apr 28 04:00:50 -0700 2008 -------
Created an attachment (id=166)
libcore.so for lx24-ia64

   ------- Additional comments from andreas Mon Apr 28 04:01:50 -0700 2008 -------
Created an attachment (id=167)
libcore.so for lx24-x86

   ------- Additional comments from andreas Mon Apr 28 06:49:52 -0700 2008 -------
Created an attachment (id=168)
Proposed patch (maintrunk)

   ------- Additional comments from andreas Wed Apr 30 06:47:05 -0700 2008 -------
Fixed in Maintrunk for Linux sge_execds.

   ------- Additional comments from andreas Tue May 13 02:23:05 -0700 2008 -------
Created an attachment (id=171)
Build.sh that I used to build libcore.so from libcore.c attached earlier
#529 fixed IZ2621: Need presubmission procedures andreas
Description

[Imported from gridengine issuezilla http://gridengine.sunsource.net/issues/show_bug.cgi?id=2621]

        Issue #:      2621             Platform:     All           Reporter: andreas (andreas)
       Component:     gridengine          OS:        All
     Subcomponent:    kernel           Version:      6.2              CC:
                                                                             [_] bbarth
                                                                             [_] reuti
                                                                             [_] Remove selected CCs
        Status:       NEW              Priority:     P3
      Resolution:                     Issue type:    ENHANCEMENT
                                   Target milestone: ---
      Assigned to:    ernst (ernst)
      QA Contact:     andreas
          URL:        http://wiki.gridengine.info/wiki/index.php/PresubmissionProcedures
       * Summary:     Need presubmission procedures
   Status whiteboard:
      Attachments:

     Issue 2621 blocks:
   Votes for issue 2621:


   Opened: Thu Jun 19 08:23:00 -0700 2008 
------------------------


There is a need for a plug-in interface that allows jobs be verified/modified in
accordance with site-specific needs:

- idea is to implement a general purpose shared library plug-in interface
- the plug-in would facilitate customized job verification/modification
- SGE would however contain only a plug-in that forks/execs/waits a script-based
plug-in
- the need for the superior (performance) shared library general purpose plug-in
interface  exists actually only for sites where raw masses of jobs float through
SGE every day
- for load-balancing a qsub/qrsh/qsh/qlogin/qmon plug-in were ideal
- for maximum of control a qmaster plug-in were ideal
- contradicting objectives (load-balancing vs. maximum of control) suggest both
qmaster and qsub/qrsh/qsh/qlogin/qmon should get a plug-in interface
- must prevent qalter can be used to bypass policy implemented via plug-ins

   ------- Additional comments from reuti Fri Jun 20 05:51:09 -0700 2008 -------
Can someome please mark http://gridengine.sunsource.net/issues/show_bug.cgi?id=2593 as a duplicate
of this. - Thx - Reuti

   ------- Additional comments from andreas Fri Jun 20 06:10:02 -0700 2008 -------
*** Issue 2593 has been marked as a duplicate of this issue. ***

   ------- Additional comments from andreas Fri Jun 20 06:38:59 -0700 2008 -------
Added URL to specification document that is in the works.

   ------- Additional comments from ernst Tue Sep 23 04:53:23 -0700 2008 -------
Today I sent a specification draft to the users mailing list (subject: job
submission verifier)

   ------- Additional comments from ernst Tue Jan 13 07:19:33 -0700 2009 -------
Script based solution is implemented and feature will be available with 6.2u2.
If performace of this solution becomes an issue we might also add support for
shared libraray JSVs easyly.
#535 fixed IZ2633: memory leak after sge_peopen() in AFS/DCE/KERBEROS code andreas
Description

[Imported from gridengine issuezilla http://gridengine.sunsource.net/issues/show_bug.cgi?id=2633]

        Issue #:      2633             Platform:     All      Reporter: andreas (andreas)
       Component:     gridengine          OS:        All
     Subcomponent:    qmaster          Version:      6.1u4       CC:    None defined
        Status:       NEW              Priority:     P3
      Resolution:                     Issue type:    DEFECT
                                   Target milestone: ---
      Assigned to:    ernst (ernst)
      QA Contact:     ernst
          URL:
       * Summary:     memory leak after sge_peopen() in AFS/DCE/KERBEROS code
   Status whiteboard:
      Attachments:

     Issue 2633 blocks:
   Votes for issue 2633:


   Opened: Wed Jun 25 08:46:00 -0700 2008 
------------------------


The AFS/DCE/KERBEROS code in libs/gdi/sge_security.c leaks memory.
Each time a sge_peopen() is done as to launch one of the script plug-in
procedures, sge_bin2string() allocates memory that is not free()'d later.

I'm filing this bug against qmaster because some of the procedures are launched
by qmaster.
Note: See TracQuery for help on using queries.