Opened 7 years ago

Closed 7 years ago

#1443 closed defect (fixed)

execd crash sending mail

Reported by: dlove Owned by: Dave Love <d.love@…>
Priority: normal Milestone:
Component: sge Version: 8.1.2
Severity: minor Keywords:
Cc:

Description

Happening frequently with a particular set of jobs:

Program terminated with signal 7, Bus error.
#0  0x00002b223c9c2018 in __deregister_frame_info () from /lib64/libgcc_s.so.1
(gdb) bt
#0  0x00002b223c9c2018 in __deregister_frame_info () from /lib64/libgcc_s.so.1
#1  0x00002b223b935a44 in _dl_fini () from /lib64/ld-linux-x86-64.so.2
#2  0x00002b223cbf8515 in exit () from /lib64/libc.so.6
#3  0x000000000045b322 in sge_send_mail (progid=<value optimized out>, 
    mailer=0x2b223baca500 "/bin/mail", user=0x2b223baca200 "***", 
    host=0x2b223baca210 "***", subj=<value optimized out>, 
    buf=0x2b223babf800 "Job 166647 (chr1chunk19) Started\n User       = nmirza\n Queue      = serial\n Host       = node165\n Start Time = 01/01/2013 15:46:44", mailer_has_subj_line=1) at ../daemons/common/mail.c:288
#4  0x000000000045b9dd in cull_mail (progid=15, 
    user_list=<value optimized out>, 
    subj=0x2b223babf400 "Job 166647 (chr1chunk19) Started", 
    buf=0x2b223babf800 "Job 166647 (chr1chunk19) Started\n User       = nmirza\n Queue      = serial\n Host       = node165\n Start Time = 01/01/2013 15:46:44", mail_type=0x5617a1 "job start") at ../daemons/common/mail.c:113
#5  0x00000000004344b5 in sge_exec_job (ctx=0x2b223ba36000, 
    jep=<value optimized out>, jatep=0x2b223bac8700, petep=0x0, 
    err_str=0x7fffa3a30a30 "\200\207\254;\"+", err_length=256)
    at ../daemons/execd/exec_job.c:1781

Change History (1)

comment:1 Changed 7 years ago by Dave Love <d.love@…>

  • Owner set to Dave Love <d.love@…>
  • Resolution set to fixed
  • Status changed from new to closed

In 4419/sge:

Fix #1443: Duplicate string passed to parse_script_params

Note: See TracTickets for help on using tickets.