[GE users] [sge6_2u2] Failed to install separate Exec Host via both GUI and CLI

zhiqitao Zhiqi.Tao at sun.com
Tue Mar 17 11:22:34 GMT 2009


Dear All,

Could any of you advise if I missed any step or configuration in my  
procedure?

I followed the installation guide on sge6_2u2 wiki but had no success  
on installing separate Exec host. Although I managed to finish Exced  
installation after copying master cell folder onto Exec host, I doubt  
it was a correct approach.

Here is my procedure:

http://wikis.sun.com/display/gridengine62u2/Custom+Installation

1. Setup two CentOS 5.2 virtual machine (VM): sms-server & sms-client

2. Configured password free for both ssh and rsh between two VMs and  
their selves.

3. Installed the lastest Java SE Runtime Environment (JRE) downloaded http://java.sun.com/javase/downloads/index.jsp
		jre-6u12-linux-x64-rpm.bin

4. Installed SGE 6.2u2 rpm packages from http://www.sun.com/software/sge/get_it.jsp 
  on both two VMs
	sun-sge-bin-linux24-x64-6.2-2.x86_64.rpm  sun-sge- 
common-6.2-2.noarch.rpm

5. Start GUI installer
[root at sms-server ~]# cd /gridware/sge/
[root at sms-server sge]# . start_gui_installer
Starting Installer ...

Tick sms-server as qmaster, execd and admin
Tick sms-client as execd and admin

However, sms-client installation failed. Please refer to the  
screenshot at

http://picasaweb.google.com/zhiqi.tao/SGE62u2InstallationErrors

6. Verified the installation result on step 5.

[root at sms-server sge]# ps ax | grep sge
  788 ?        Sl     0:00 /gridware/sge/bin/lx24-amd64/sge_execd
1651 pts/3    S+     0:00 grep sge
32632 ?        Sl     0:00 /gridware/sge/bin/lx24-amd64/sge_qmaster

7. Tried the CLI method.

http://wikis.sun.com/display/gridengine62u2/How+to+Install+Execution+Hosts

Verified that sms-client is on the administrative host list.
[root at sms-server sge]# qconf -sh
sms-client
sms-server

on sms-client

[root at sms-client sge]# pwd
/gridware/sge
[root at sms-client sge]# ./install_execd



Grid Engine cells
-----------------

Please enter cell name which you used for the qmaster
installation or press <RETURN> to use [default] >>

Obviously there was no qmaster installation yet!
Call >install_qmaster<
on the machine which shall run the Grid Engine qmaster

[root at sms-client sge]# ls
3rd_party  doc       install_execd    man   start_gui_installer
bin        dtrace    install_qmaster  mpi   util
catman     examples  inst_sge         pvm   utilbin
ckpt       include   lib              qmon
[root at sms-client sge]#

Yes, there is no default/common directory on sms-client

8. Manually copied sms-server:/gridware/sge/default/common to sms- 
client and tried again.

On sms-client
[root at sms-client sge]# mkdir default
[root at sms-client sge]# cd default/
[root at sms-client default]# rcp -r sms-server:/gridware/sge/default/ 
common .
[root at sms-client default]# ls
common

Restart gui installer on sms-server:
[root at sms-server sge]# ./start_gui_installer -connect_user=root
Starting Installer ...

Only installed Execution hosts: sms-client
All went well and installation finished without error.

[root at sms-client sge]# . /gridware/sge/default/common/settings.sh
[root at sms-client sge]# qconf -sh
sms-client
sms-server
[root at sms-client sge]# ps -u root|grep sge
16162 ?        00:00:00 sge_execd


Submitted a simple job on both host.
[root at sms-client sge]# qsub $SGE_ROOT/examples/jobs/simple.sh
Your job 1 ("simple.sh") has been submitted
[root at sms-client sge]# qstat
job-ID  prior   name       user         state submit/start at      
queue                          slots ja-task-ID
-----------------------------------------------------------------------------------------------------------------
      1 0.00000 simple.sh  root         qw    03/17/2009  
06:54:36                                    1
[root at sms-client sge]# qstat
job-ID  prior   name       user         state submit/start at      
queue                          slots ja-task-ID
-----------------------------------------------------------------------------------------------------------------
      1 0.55500 simple.sh  root         r     03/17/2009 06:54:43  
all.q at sms-client                   1
[root at sms-client sge]# qstat
job-ID  prior   name       user         state submit/start at      
queue                          slots ja-task-ID
-----------------------------------------------------------------------------------------------------------------
      1 0.55500 simple.sh  root         r     03/17/2009 06:54:43  
all.q at sms-client                   1
[root at sms-client sge]#



[root at sms-server sge]# qconf -as sms-server
sms-server added to submit host list
[root at sms-server sge]# qsub $SGE_ROOT/examples/jobs/simple.sh
Your job 2 ("simple.sh") has been submitted
[root at sms-server sge]# qstat
job-ID  prior   name       user         state submit/start at      
queue                          slots ja-task-ID
-----------------------------------------------------------------------------------------------------------------
      2 0.55500 simple.sh  root         r     03/17/2009 06:55:13  
all.q at sms-client                   1


[root at sms-client ~]# cat simple.sh.o1
Tue Mar 17 06:54:43 EDT 2009
Tue Mar 17 06:55:03 EDT 2009
[root at sms-client ~]# cat simple.sh.o2
Tue Mar 17 06:55:13 EDT 2009
Tue Mar 17 06:55:33 EDT 2009


Thank!

Zhiqi

------------------------------------------------------
http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=134216

To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].



More information about the gridengine-users mailing list