[GE users] exclude a host in qsub

Reuti reuti at staff.uni-marburg.de
Tue Dec 13 00:37:32 GMT 2005


Hi,

Am 13.12.2005 um 01:11 schrieb Joe Fu:

> Hi Charu
>
>
> The problem is the user has to have manager role to do that. The
> scenario is normal users run   thousand jobs during night time, and  
> they
> notice one machine eats job and they want to avoid the machine and IT
> support is not available during night.
>

we have this effect of a black hole in the cluster if for any  
reasons /tmp is full on a node. For this I installed the described  
load sensor from the Howto page and setup a load_threshold for  
tmpfree, which will send this machine to alarm state if less than 1  
GB is free. Maybe a similar setup could also detect your non-working  
node on it's own.

Cheers - Reuti

> Thx
> -Joe
>
> -----Original Message-----
> From: Charu.Chaubal at Sun.COM [mailto:Charu.Chaubal at Sun.COM]
> Sent: Monday, December 12, 2005 3:05 PM
> To: users at gridengine.sunsource.net
> Subject: Re: [GE users] exclude a host in qsub
>
> Hi Joe,
>
> You can disable all queue instances on that host:
>
> qmod -d '*@<hostname>'
>
> Then, no jobs will get submitted to that host; running jobs will be
> allowed to finish.
>
> Regards,
> 	Charu
>
>
>
> Joe Fu wrote On 12/12/05 14:35,:
>> Hi
>>
>> Lsf has some nice features like you can exlude a host(s) to bsub so
> jobs
>> won't go to the machines. Anyone has a good way to achieve the  
>> same in
>> GE?
>>
>> Thanks
>>
>> -Joe
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
>> For additional commands, e-mail: users-help at gridengine.sunsource.net
>>
>
> -- 
> ####################################################################
> # Charu V. Chaubal              # Phone: (650) 786-7672 (x87672)   #
> # Grid Computing Technologist   # Fax:   (650) 786-4591            #
> # Sun Microsystems, Inc.        # Email: charu.chaubal at sun.com     #
> ####################################################################
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
> For additional commands, e-mail: users-help at gridengine.sunsource.net
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
> For additional commands, e-mail: users-help at gridengine.sunsource.net
>


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list