[GE users] prevent users from executing jobs on nodes except via sungrid

Jerry Mersel jerry.mersel at weizmann.ac.il
Tue Mar 28 13:26:36 BST 2006


    [ The following text is in the "iso-8859-1" character set. ]
    [ Your display is set for the "ISO-8859-10" character set.  ]
    [ Some special characters may be displayed incorrectly. ]

Well I definitely need tight integration, so I guess I'll
have to go over to rsh.

                                Thanks,
                                 Jerry

> That means sshd is reading the default config file!
>
> So if you disable login for the system sshd, the one launched by
> SGE will also get the same configuration.
>
> The way to get this to work is to copy the configuration to
> another location, and then disable login in the default one.
> Then add "-f <path to the backup config file>" to rsh_daemon and
> rlogin_daemon in your cluster configuration :
>
> http://gridengine.sunsource.net/howto/qrsh_qlogin_ssh.html
>
> Note that when you use SSH, the PE is not tight anymore. Tight
> SSH integration is enabled in maintrunk, and may be back-ported
> to V6.0:
>
> http://gridengine.sunsource.net/servlets/BrowseList?list=dev&by=thread&from=9051
>
>  -Ron
>
>
>
> --- Jerry Mersel <jerry.mersel at weizmann.ac.il> wrote:
>> Thank you for your quick response.
>>
>> I appear to be using sshd.
>> Next step?
>>
>>                   Thanks,
>>                    Jerry
>>
>> > Please re-enable normal user login and then find out when it
>> is
>> > enabled (so that parallel jobs do not fail), whether the PE
>> uses sshd
>> > or rshd - also find out if it uses the system rshd or SGE
>> rshd.
>> >
>> > You can get that info by looking at the parent/child
>> relationship of
>> > the slave MPI tasks.
>> >
>> > If you are using SGE rshd, and if you disable login by
>> creating
>> > /etc/nologin, then follow this:
>> >
>>
> http://gridengine.sunsource.net/servlets/ReadMsg?listName=users&msgNo=5023
>> >
>> > Otherwise, if it is because you are not using SGE's rshd,
>> then tight
>> > integration is not configured correctly...
>> >
>> > Rayson
>> >
>> >
>> >
>> > On 3/27/06, Jerry Mersel <jerry.mersel at weizmann.ac.il>
>> wrote:
>> >> I thought after setting up tight integration everything was
>> working,
>> >> but I was mistaken.
>> >>
>> >>
>> >> When I run a parallel job with MPICH I still get errors in
>> the stderr
>> >> output file such as:
>> >>
>> >> Child xxx exited without finalize.
>> >>
>> >> If I allow for the user to login without password on the
>> other nodes
>> >> it works. But I only want root to log into the other nodes.
>> >>
>> >>
>> >> Here is the PE setup:
>> >>
>> >> pe_name           mpi
>> >> slots             999
>> >> user_lists        NONE
>> >> xuser_lists       NONE
>> >> start_proc_args   /home/mlmersel/mpi/startmpi.sh -catch_rsh
>> $pe_hostfile
>> >> stop_proc_args    /home/mlmersel/mpi/stopmpi.sh
>> >> allocation_rule   $round_robin
>> >> control_slaves    TRUE
>> >> job_is_first_task FALSE
>> >> urgency_slots     min
>> >>
>> >>
>> >> # ---------------------------
>> >> # our name
>> >> #$ -N MPI_Job
>> >> #
>> >> # pe request
>> >> #$ -pe mpi 2-8
>> >> #
>> >> # MPIR_HOME from submitting environment
>> >> #$ -v MPIR_HOME
>> >> # ---------------------------
>> >>
>> >> Here is the script:
>> >>
>> >>
>> >> #
>> >> # needs in
>> >> #   $NSLOTS
>> >> #       the number of tasks to be used
>> >> #   $TMPDIR/machines
>> >> #       a valid machine file to be passed to mpirun
>> >>
>> >> echo "Got $NSLOTS slots."
>> >>
>> >> /usr/voltaire/mpi/bin/mpirun_ssh -np 2 -hostfile
>> $TMPDIR/machines
>> >> /usr/voltaire/mpi/bin/cpi
>> >>
>> >>
>> >> I usually load it using qmon with pe mpi 2-8.
>> >>
>> >>
>> >> I'm not sure on how to solve this so any help will be of
>> benefit.
>> >>
>> >>
>> >>                              Thanks,
>> >>                                 Jerry
>> >
>> >
>>
> ---------------------------------------------------------------------
>> > To unsubscribe, e-mail:
>> users-unsubscribe at gridengine.sunsource.net
>> > For additional commands, e-mail:
>> users-help at gridengine.sunsource.net
>> >
>> >
>> >
>>
>>
>>
> ---------------------------------------------------------------------
>> To unsubscribe, e-mail:
>> users-unsubscribe at gridengine.sunsource.net
>> For additional commands, e-mail:
>> users-help at gridengine.sunsource.net
>>
>>
>
>
> __________________________________________________
> Do You Yahoo!?
> Tired of spam?  Yahoo! Mail has the best spam protection around
> http://mail.yahoo.com
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
> For additional commands, e-mail: users-help at gridengine.sunsource.net
>
>
>


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list