[GE users] user configs disappear after qmaster restart (6.2u5).

ccaamad m.c.dixon at leeds.ac.uk
Sat May 15 13:21:01 BST 2010


Anyone seen this one before?

I've just restarted a 6.2u5 qmaster after patching its host and noticed 
that most of my users have vanished from "qconf -suserl". There were 104 
and there are now 17.

I'm using classic spooling to an NFS server and all 104 configuration 
files are still there under $SGE_ROOT/$SGE_CELL/spool/qmaster/users.

However, the qmaster messages file contains lots of scary things like:

05/15/2010 12:27:29|  main|sched1|E|unrecognized characters after the attribute values in line 12: "0.000000"
05/15/2010 12:27:29|  main|sched1|E|unrecognized characters after the attribute values in line 12: "cpu"
05/15/2010 12:27:29|  main|sched1|E|error reading file: "/services/sge/default/spool/qmaster/users/amtgwm"
05/15/2010 12:27:29|  main|sched1|E|unrecognized characters after the attribute values in line 12: "mem"
05/15/2010 12:27:29|  main|sched1|E|line 12 should begin with an attribute name
05/15/2010 12:27:29|  main|sched1|E|error reading file: "/services/sge/default/spool/qmaster/users/menkku"
05/15/2010 12:27:29|  main|sched1|E|error reading file: "/services/sge/default/spool/qmaster/users/pmis"
05/15/2010 12:27:29|  main|sched1|E|unrecognized characters after the attribute values in line 12: "mem"
05/15/2010 12:27:29|  main|sched1|E|line 12 should begin with an attribute name

Some of the objectional files don't really have much in them to object 
about:

$ cat -vE pmmgh
# Version: 6.2u5$
# $
# DO NOT MODIFY THIS FILE MANUALLY!$
# $
name pmmgh$
oticket 0$
fshare 0$
delete_time 0$
usage NONE$
usage_time_stamp 0$
long_term_usage NONE$
default_project ENG$


And some take quite a bit of reading, like the following, but don't 
immediately distinguish themselves to me from successfully-read ones:

$ cat -vE menkku
# Version: 6.2u5$
# $
# DO NOT MODIFY THIS FILE MANUALLY!$
# $
name menkku$
oticket 0$
fshare 0$
delete_time 0$
usage NONE$
usage_time_stamp 1271760722$
long_term_usage NONE$
project ENG cpu=733940.898122,mem=734057.904911,io=0.000000,iow=0.000000,vmem=128141781393.995209,maxvmem=128794461072.526474,submission_time=5657772816.237907,priority=0.000000,exit_status=185.512382,signal=326.949626,start_time=5657828277.780435,end_time=5657854733.696654,ru_wallclock=26455.916219,ru_utime=76141.137545,ru_stime=43.526298,ru_maxrss=0.000000,ru_ixrss=0.000000,ru_ismrss=0.000000,ru_idrss=0.000000,ru_isrss=0.000000,ru_minflt=3161731.764305,ru_majflt=472.276619,ru_nswap=0.000000,ru_inblock=0.000000,ru_oublock=0.000000,ru_msgsnd=0.000000,ru_msgrcv=0.000000,ru_nsignals=0.000000,ru_nvcsw=153945.149340,ru_nivcsw=873658.805998,acct_cpu=739721.396629,acct_mem=739839.871700,acct_io=0.000000,acct_iow=0.000000,acct_maxvmem=129715679204.997192,finished_jobs=0.000000 cpu=1202948.000000,mem=1334484.000000,io=0.000000,iow=0.000000,vmem=1032939606016.000000,maxvmem=1680405954560.000000,submission_time=65893467346.000000,priority=0.000000,exit_status=4635.000000,signal=18200.!
 000000,start_time=65893672177.000000,end_time=65893744716.000000,ru_wallclock=72539.000000,ru_utime=182045.514849,ru_stime=137.918007,ru_maxrss=0.000000,ru_ixrss=0.000000,ru_ismrss=0.000000,ru_idrss=0.000000,ru_isrss=0.000000,ru_minflt=11323678.000000,ru_majflt=2848.000000,ru_nswap=0.000000,ru_inblock=0.000000,ru_oublock=0.000000,ru_msgsnd=0.000000,ru_msgrcv=0.000000,ru_nsignals=0.000000,ru_nvcsw=473882.000000,ru_nivcsw=3012282.000000,acct_cpu=1202948.000000,acct_mem=1334484.000000,acct_io=0.000000,acct_iow=0.000000,acct_maxvmem=1680405954560.000000,finished_jobs=52.000000;$
default_project ENG$


I'm using SGE 6.2u5 on x86_64 RHEL5.

Any ideas?

Thanks,

Mark
-- 
-----------------------------------------------------------------
Mark Dixon                       Email    : m.c.dixon at leeds.ac.uk
HPC/Grid Systems Support         Tel (int): 35429
Information Systems Services     Tel (ext): +44(0)113 343 5429
University of Leeds, LS2 9JT, UK
-----------------------------------------------------------------

------------------------------------------------------
http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=257387

To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].



More information about the gridengine-users mailing list