[GE users] berkeley checkpointing and matlab

Jerry Mersel jerry.mersel at weizmann.ac.il
Sun Jan 6 08:44:06 GMT 2008


    [ The following text is in the "iso-8859-1" character set. ]
    [ Your display is set for the "ISO-8859-10" character set.  ]
    [ Some special characters may be displayed incorrectly. ]

berkely  checkpointing and matlab are working from the command line for me.

However when I try to use it with SGE I get a "restart error".

Here is what I sent to the berkeley checkpoint people:


(I also get the relocation error from the command line)
   manage to checkpoint matlab processes  from the command line.
But when I want to use SGE I get the error:
/lib64/libc.so.6: relocation error: /lib64/tls/libpthread.so.0: symbol
errno, version GLIBC_PRIVATE not defined in file libc.so.6 with link time
reference
Restart failed: No such device or address

The relocation error I get on the start using cr_run.
The Restart failed I get when trying to restart.

I start matlab thus:
${BLCR_HOME}/bin/cr_run env LD_PRELOAD=libcr.so.0:libpthread.so.0 matlab
-nojvm -nodisplay -nosplash < $H/test.m

and try to restart thus:
${BLCR_HOME}/bin/cr_restart $ckptfile

my log file says this:
Jan  2 14:24:36 kam02 kernel: Skipping a socket.
Jan  2 14:24:36 kam02 kernel: Skipping a socket.
Jan  2 14:26:03 kam02 kernel: Failed to open chrdev major=5 minor=0
path='/dev/tty')
Jan  2 14:26:03 kam02 kernel: cr_restore_all_files [28703]:  Unable to
restore fd 3 (type=6,err=-6)
Jan  2 14:26:03 kam02 kernel: cr_rstrt_child [28703]:  Unable to restore
files!  (err=-6)

Please help.

                               Regards,
                                  Jerry


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list