[GE users] calendar and no h_rt request

murple andreas.kuntzagk at mdc-berlin.de
Thu Oct 21 07:11:37 BST 2010


reuti wrote:
> Am 20.10.2010 um 17:11 schrieb murple:
> 
>> You mean I should set the cluster wide default for h_rt to this value? It gives 
>> me "Complex "h_rt" cannot have a default value" or what is "default_duration"? 
>> and where do I set it?
> 
> $ qconf -msconf

This is already set:

kuntzagk at login2:~> qconf -ssconf
algorithm                         default
schedule_interval                 0:0:15
maxujobs                          0
queue_sort_method                 load
job_load_adjustments              np_load_avg=1.00
load_adjustment_decay_time        0:7:30
load_formula                      np_load_avg
schedd_job_info                   true
flush_submit_sec                  0
flush_finish_sec                  0
params                            none
reprioritize_interval             0:0:0
halftime                          168
usage_weight_list                 cpu=0.667000,mem=0.333000,io=0.000000
compensation_factor               0.500000
weight_user                       0.250000
weight_project                    0.250000
weight_department                 0.250000
weight_job                        0.250000
weight_tickets_functional         0
weight_tickets_share              10000
share_override_tickets            TRUE
share_functional_shares           TRUE
max_functional_jobs_to_schedule   200
report_pjob_tickets               TRUE
max_pending_tasks_per_job         50
halflife_decay_list               none
policy_hierarchy                  OFS
weight_ticket                     1.000000
weight_waiting_time               3600.000000
weight_deadline                   3600000.000000
weight_urgency                    0.100000
weight_priority                   1.000000
max_reservation                   20
default_duration                  99999:59:59

Anything else that should be set to make this work?

Andreas

> 
> near the end.
> 
> -- Reuti
> 
> 
>> regards,
>>
>> reuti wrote:
>>> Hi,
>>>
>>> Am 20.10.2010 um 16:15 schrieb murple:
>>>
>>>> to prepare for a planned downtime I created a calendar a added it to the queue.
>>>> Now when I submit a job without explicitly requesting h_rt I will not be started 
>>>> "due to a reservation" despite that the queues h_rt limit ends before the downtime.
>>>>
>>>> If I explicitely request h_rt (with the same value as queue limit) it works.
>>> yep, the "default_duration" will be applied when the reservation is checked, which is "infinity". You could lower it to a more feasible value (up to h_rt), which would also avoid wrong backfilling (SGE judges "infinity" being smaller than "infinity", and so jobs might slip in for backfilling although they shouldn't. I set it to 9999:99:99 to avoid this side effect).
>>>
>>> -- Reuti
>>>
>>>
>>>> This is 6.2 (u0)
>>>>
>>>> here my setup:
>>>>
>>>> kuntzagk at login2:~> qconf -sq test
>>>> qname                 test
>>>> hostlist              @nodes @nodes_v2
>>>> seq_no                20
>>>> load_thresholds       np_load_avg=1.75
>>>> suspend_thresholds    NONE
>>>> nsuspend              1
>>>> suspend_interval      00:05:00
>>>> priority              0
>>>> min_cpu_interval      00:05:00
>>>> processors            UNDEFINED
>>>> qtype                 BATCH
>>>> ckpt_list             NONE
>>>> pe_list               make orte smp
>>>> rerun                 TRUE
>>>> slots                 4,[@fastq_nodes=2],[@nodes_v2=8],[node001=8]
>>>> tmpdir                /tmp
>>>> shell                 /bin/csh
>>>> prolog                NONE
>>>> epilog                NONE
>>>> shell_start_mode      unix_behavior
>>>> starter_method        NONE
>>>> suspend_method        NONE
>>>> resume_method         NONE
>>>> terminate_method      NONE
>>>> notify                00:00:60
>>>> owner_list            NONE
>>>> user_lists            root
>>>> xuser_lists           NONE
>>>> subordinate_list      longrun
>>>> complex_values        NONE
>>>> projects              NONE
>>>> xprojects             NONE
>>>> calendar              downtime_04_11_2010
>>>> initial_state         default
>>>> s_rt                  96:00:00
>>>> h_rt                  96:00:00
>>>> s_cpu                 INFINITY
>>>> h_cpu                 INFINITY
>>>> s_fsize               INFINITY
>>>> h_fsize               INFINITY
>>>> s_data                INFINITY
>>>> h_data                INFINITY
>>>> s_stack               INFINITY
>>>> h_stack               256M
>>>> s_core                INFINITY
>>>> h_core                INFINITY
>>>> s_rss                 INFINITY
>>>> h_rss                 INFINITY
>>>> s_vmem                INFINITY
>>>> h_vmem                8G
>>>>
>>>> kuntzagk at login2:~> qconf -scal downtime_04_11_2010
>>>> calendar_name    downtime_04_11_2010
>>>> year             4.11.2010-5.11.2010=17:30-8:00=off
>>>> week             NONE
>>>>
>>>> kuntzagk at login2:~> qsub -q test  /opt/sge/examples/jobs/simple.sh
>>>> kuntzagk at login2:~> qstat -j
>>>> ..
>>>> (-l NONE) cannot run at host "node055" because for default request it offers 
>>>> only hc:h_vmem=536870912.000000
>>>>
>>>> kuntzagk at login2:~> qalter -l h_rt=96:00:00 /opt/sge/examples/jobs/simple.sh
>>>>
>>>> .. works
>>>>
>>>> regards, Andreas
>>>>
>>>> ------------------------------------------------------
>>>> http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=288639
>>>>
>>>> To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].
>>> ------------------------------------------------------
>>> http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=288641
>>>
>>> To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].
>> ------------------------------------------------------
>> http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=288645
>>
>> To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].
> 
> ------------------------------------------------------
> http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=288647
> 
> To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].

------------------------------------------------------
http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=288825

To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].



More information about the gridengine-users mailing list