[GE users] Department problem

Richard Ems r.ems at gmx.net
Sat Mar 11 19:25:54 GMT 2006


    [ The following text is in the "ISO-8859-1" character set. ]
    [ Your display is set for the "ISO-8859-10" character set.  ]
    [ Some special characters may be displayed incorrectly. ]

Hi list!

Does the following bug still exist?

The workaround posted by Andy below doesn't help me, since I need to
define "fshare" on the departments.

I'm also getting "has no permission for queue ...", must users defined
in a DEPT acl be explicitly defined? Or is "enforce_user auto" doing it?

Thanks, Richard

> Date: Fri, 10 Dec 2004 16:19:53 +0100 (MET)
> From: Andy Schwierskott <andy.schwierskott at sun.com>
> Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed
> Subject: [GE users] Problem with department + all.q
> 
> 
> Franz,
> 
> The DEPT needs to be an access list as well. See access_list(5) that it can
> be used as an access list.
> 
> The problems is that Grid engine shouldn't allow the config you tried (or
> Grid Engine should allow DEPT's to be used as acess lists aswell - which
> might be more intuitive).
> 
> Another problem is that "qsub -w v" indicates that it should work. That's
> another smaller bug.
> 
> Workaround: Define your DEPT as ACL as well.
> 
> Please file an issue in Issuezilla.
> 
> Andy
> 
>> Hi there!
>>
>> We have a problem and I hope someone could help us to solve this one. At 
>> first any details about the versions  and OS: We use gridengine 6 Update 1 
>> and RedHat  3.1 AS.
>> Now to the problem. The problem is to submit jobs with a user under the type 
>> department. When we want to submit jobs with the user  c10253 and we get the 
>> ollowing output:
>>
>> [c10253 at hc-ma simpleJob]$ qstat -j 372
>> job_number:                 372
>> exec_file:                  job_scripts/372
>> submission_time:            Mon Dec  6 16:27:59 2004
>> owner:                      c10253
>> uid:                        46682
>> group:                      c102
>> gid:                        114
>> sge_o_home:                 /home/c102/c10253
>> sge_o_log_name:             c10253
>> sge_o_path: 
>> /usr/site/hc/mpich/current/GNU/bin:/usr/site/hc/sge/current/bin/lx24-amd64:/usr/site/hc/bin:/usr/local/bin:/usr/bin/X11:/usr/bin:/bin:/home/c102/c10253/bin:/usr/site/bin::/usr/site/hc/pgi/5.2/linux86-64/5.2/bin:/home/c102/c10253/bin
>> sge_o_shell:                /bin/bash
>> sge_o_workdir:              /home/c102/c10253/tutorial/simpleJob
>> sge_o_host:                 hc-ma
>> account:                    sge
>> cwd:                        /home/c102/c10253/tutorial/simpleJob
>> path_aliases:               /tmp_mnt/ * * /
>> stderr_path_list:           error.dat
>> mail_options:               e   mail_list:                  xxxx at xxx.com
>> notify:                     FALSE
>> job_name:                   job
>> stdout_path_list:           output.dat
>> jobshare:                   0
>> hard_queue_list:            all.q
>> env_list:                   script_file:                job
>> scheduling info:            has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>                           has no permission for queue "all.q at xxx.com"
>>
>> Here are the configs of the user, all.q and the group of user 10253:
>>
>>
>> [root at hc-ma root]# qconf -suser c10253
>>
>> name c10253
>> oticket 250
>> fshare 10
>> delete_time 0
>> default_project NONE
>>
>> [root at hc-ma root]# qconf -sq all.q
>> qname                 all.q
>> hostlist              @allhosts
>> seq_no                0
>> load_thresholds       np_load_avg=1.75
>> suspend_thresholds    NONE
>> nsuspend              1
>> suspend_interval      00:05:00
>> priority              0
>> min_cpu_interval      00:05:00
>> processors            UNDEFINED
>> qtype                 BATCH INTERACTIVE
>> ckpt_list             NONE
>> pe_list               mpich-sge mpich-2perhost mpich-1perhost \
>>                     mpich-roundrobin mpich-fillup
>> rerun                 FALSE
>> slots                 140,[hc-003.xxx.com=4],[hc-007.xxx.com=4], \
>>                     [hc-004.xxx.com=4],[hc-002.xxx.com=4], \
>>                     [hc-009.xxx.com=4],[hc-008.xxx.com=4], \
>>                     [hc-006.xxx.com=4],[hc-013.xxx.com=4], \
>>                     [hc-012.xxx.com=4],[hc-014.xxx.com=4], \
>>                     [hc-010.xxx.com=4],[hc-005.xxx.com=4], \
>>                     [hc-001.xxx.com=4],[hc-015.xxx.com=4], \
>>                     [hc-016.xxx.com=4],[hc-029.xxx.com=4], \
>>                     [hc-035.xxx.com=4],[hc-032.xxx.com=4], \
>>                     [hc-019.xxx.com=4],[hc-026.xxx.com=4], \
>>                     [hc-018.xxx.com=4],[hc-023.xxx.com=4], \
>>                     [hc-022.xxx.com=4],[hc-024.xxx.com=4], \
>>                     [hc-025.xxx.com=4],[hc-033.xxx.com=4], \
>>                     [hc-028.xxx.com=4],[hc-017.xxx.com=4], \
>>                     [hc-020.xxx.com=4],[hc-027.xxx.com=4], \
>>                     [hc-021.xxx.com=4],[hc-030.xxx.com=4], \
>>                     [hc-034.xxx.com=4],[hc-031.xxx.com=4], \
>>                     [hc-011.xxx.com=4]
>> tmpdir                /tmp
>> shell                 /bin/bash
>> prolog                NONE
>> epilog                NONE
>> shell_start_mode      posix_compliant
>> starter_method        NONE
>> suspend_method        NONE
>> resume_method         NONE
>> terminate_method      NONE
>> notify                00:00:60
>> owner_list            NONE
>> user_lists            gr_c706 gr_c102
>> xuser_lists           NONE
>> subordinate_list      NONE
>> complex_values        NONE
>> projects              NONE
>> xprojects             NONE
>> calendar              NONE
>> initial_state         default
>> s_rt                  INFINITY
>> h_rt                  INFINITY
>> s_cpu                 INFINITY
>> h_cpu                 INFINITY
>> s_fsize               INFINITY
>> h_fsize               INFINITY
>> s_data                INFINITY
>> h_data                INFINITY
>> s_stack               INFINITY
>> h_stack               INFINITY
>> s_core                INFINITY
>> h_core                INFINITY
>> s_rss                 INFINITY
>> h_rss                 INFINITY
>> s_vmem                INFINITY
>> h_vmem                INFINITY
>>
>> [root at hc-ma root]# qconf -su gr_c102  name    gr_c102
>> type    DEPT
>> fshare  0
>> oticket 0
>> entries c10253
>>
>> If we use an ACl instead of an department we could submit the job without any 
>> problems. So maybe someone could help us to solve the problem.
>> Best Regards
>>
>> Franz

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list