[GE users] Mac OS X: unable to run job error

Ed Baskerville lists at edbaskerville.com
Thu Jun 1 21:12:08 BST 2006


Hm, those utils all seem to be returning the correct thing (both ip- 
 >name and name->ip). So it looks like this isn't a DNS problem.

As Chris mentioned, I've found an SGE setup I can use, so I will just  
give up on this for now.

--Ed

On Jun 1, 2006, at 6:53 AM, Rayson Ho wrote:

> (resending... don't know why it didn't get to the list :(  )
>
> You can also run install_execd on the master machine so that you can
> at least run something via SGE with a 1-node cluster.
>
> For hostname resolution, what do you see when you run "qhost"??
>
> Also, search in the "users" list mail archive for the usage of
> "gethostname" & "gethostbyname". Those utils are in
> $SGE_ROOT/utilbin/<arch>/ , and they use the same code to resolve for
> the hostname/IP.
>
> Rayson
>
>
> On 5/30/06, Ed Baskerville <lists at edbaskerville.com> wrote:
>> Hello,
>>
>> I'm trying to set up a test grid using two machines running Mac OS X:
>> a G5, which will act as both master and execution host, and a
>> PowerBook, which will just be an execution host.
>>
>> I went through the interactive install process: set up the qmaster
>> successfully, then set up two execution hosts. Everything looked good
>> until the very end of the execution host setup process, when I got a
>> message like this on the two machines:
>>
>> unable to resolve host 'astor'
>>
>> followed by a message that setup was successfully completed.
>>
>> I hoped for the best, but when I submitted a job I got this message:
>>
>> astor:/usr/local/gridengine/examples/jobs ebaskerv$ qsub simple.sh
>> Unable to run job: warning: ebaskerv your job is not allowed to run
>> in any queue
>> Your job 5 ("simple.sh") has been submitted.
>> Exiting.
>>
>> My pending jobs queue looks really nice now:
>>
>> astor:/usr/local/gridengine/examples/jobs ebaskerv$ qstat -f
>> ##################################################################### 
>> ###
>> ####
>> - PENDING JOBS - PENDING JOBS - PENDING JOBS - PENDING JOBS - PENDING
>> JOBS
>> ##################################################################### 
>> ###
>> ####
>>       1 0.55500 simple.sh  ebaskerv     qw    05/30/2006  
>> 02:02:10     1
>>       2 0.55500 simple.sh  ebaskerv     qw    05/30/2006  
>> 02:02:31     1
>>       3 0.55500 simple.sh  ebaskerv     qw    05/30/2006  
>> 02:02:56     1
>>       4 0.55500 simple.sh  ebaskerv     qw    05/30/2006  
>> 02:04:33     1
>>       5 0.55500 simple.sh  ebaskerv     qw    05/30/2006  
>> 12:11:17     1
>>
>> I strongly suspect this has to do with the fact that I don't have a
>> real DNS setup--I was hoping I could use .local Zeroconf/Bonjour (nee
>> Rendezvous) hostnames to identify the machines. I did have to do one
>> DNS hack during execution host setup: to get my PowerBook to be
>> recognized properly by the master, I had to add a mapping to /etc/
>> hosts from its IP address to its Zeroconf name darwin.local.
>>
>> All the nitty gritty details of my installation are here in several
>> blog entries, in case that helps:
>>
>> http://code.edbaskerville.com/
>>
>> Any help would be much appreciated! I'm coming from the world of
>> Xgrid, where things are less powerful, but also much less  
>> complicated.
>>
>> Thanks,
>> Ed
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
>> For additional commands, e-mail: users-help at gridengine.sunsource.net
>>
>>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
> For additional commands, e-mail: users-help at gridengine.sunsource.net
>

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list