[GE users] sge_shepherd : free(): invalid pointer crash for more than 1032 slots

henk h.a.slim at durham.ac.uk
Thu May 13 15:15:37 BST 2010


On our system the gridengine 6.2u5 shepherd crashes for a simple
parallel job with 1040 slots. It is fine for 1032 slots (each server has
8 cores and I increment the job size by adding a server). I attach the
error file with a memory map. MPI is OpenMPI 1.4.1 and OS is SLES 11.0
AS a test I kept the 1032 slots on fixed servers and varied the server
that supplied the additional 8 slots, all giving this problem.
Is there some magcical number beyond 1032 that causes a problem for the
shepherd exe?

Thanks

Henk

------------------------------------------------------
http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=257181

To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].

    [ Part 2, "shepherd_crash_1040slots.e1284.txt"  Text/PLAIN (Name: ]
    [ "shepherd_crash_1040slots.e1284.txt") ~5.2 KB. ]
    [ Unable to print this part. ]



More information about the gridengine-users mailing list