[GE users] Cant read usage file error

Paul Mitchell pmitchel at email.unc.edu
Wed Nov 17 21:52:48 GMT 2004


Hello All,
  Just an update here; with the help of a friend at UNC (who gave me quite
a few clues) I reloaded the master and client software (this time using
the same directory for both master and client, via NFS).  I also told the
master installer that the domains were not in the same DNS domain (though
they really are) and not to use a local spool directory.

Consequently, I'm a little further down the road.

I can, as myself and root, run a sucessfull qrsh command (hooray!^)

I then changed the ownership of $SGE_ROOT/utilbin/rshd to the local admin
user "grid" (which I defined in the install) and gave that user rwx
privilege.

I'm still running afoul of a usage problem, though:

qsub testls
Your job 10 ("testls") has been submitted.
$ qsub $SGE_ROOT/examples/jobs/simple.sh
Your job 11 ("simple.sh") has been submitted.

from $SGE_ROOT/default/spool/qmaster/messages:

11/17/2004 16:14:26|qmaster|bp01|W|job 10.1 failed on host
bp08.isis.unc.edu general opening input/output file because: can't read
usage file for job 10.1
11/17/2004 16:14:26|qmaster|bp01|W|rescheduling job 10.1
11/17/2004 16:25:41|qmaster|bp01|W|job 11.1 failed on host
bp08.isis.unc.edu general opening input/output file because: can't read
usage file for job 11.1
11/17/2004 16:25:41|qmaster|bp01|W|rescheduling job 11.1

qstat -f
queuename                      qtype used/tot. load_avg arch
states
----------------------------------------------------------------------------
all.q at bp08.isis.unc.edu        BIP   0/2       0.02     darwin

############################################################################
 - PENDING JOBS - PENDING JOBS - PENDING JOBS - PENDING JOBS - PENDING
JOBS
############################################################################
     10 0.00000 testls     pmitchel     Eqw   11/17/2004 16:14:11     1
     11 0.00000 simple.sh  pmitchel     Eqw   11/17/2004 16:25:27     1

qstat -j
Jobs dropped because of error state
        10,     11

sge_execd is running,

ps -auxww | grep sge
grid       532   0.0 -0.0    28576   1044  ??  S     3:43PM   0:00.53
/usr/local/sge/bin/darwin/sge_execd

I've downloaded the fs_epilog and fs_prolog shells from
http://gridengine.sunsource.net/project/gridengine/howto/filestaging/filestage6.html
and will try and get them working tomorrow. Perhaps therein lies the
solution.

Paul Mitchell

==============================================================================
	Paul Mitchell
	email: pmitchel at email.unc.edu
	phone: (919) 962-9778
	office: I have an office, room 14, Phillips Hall
==============================================================================





---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list