[GE users] Failed searching requested shell

Ron Chen ron_chen_123 at yahoo.com
Fri Apr 2 06:20:06 BST 2004


Hi,

I almost totally forgot what you asked a few months
ago (sorry)!

Pls update me... Does it happen randomly? And what did
you set the shell_start_mode to? Which shell was
requested in the failed jobs?

 -Ron


--- Frans.Raulo at nokia.com wrote:
> Hello!
> 
> I asked about this a few (?) months ago but got no
> answers.... But sometimes users run into this
> problem and I really can't understand the problem
> because the shell(s) should be readily available and
> in the user's PATH. Or maybe I haven't understood
> this correctly... Could someone please point me to
> the right direction?
> 
> 
> 
> Job 191244 caused action: Job 191244 set to ERROR
>  User        = xxxxxxx	
>  Queue       = sohvi02.q
>  Host        = sohvi02.xxxxxxxxxx.xxx
>  Start Time  = <unknown>
>  End Time    = <unknown>
> failed searching requested shell:03/31/2004 15:35:33
> [10166669:17608]:
>
execvp(/misc/sge/5.3p2/default/spool/sohvi02/job_scripts/19124
> Shepherd trace:
> 03/31/2004 15:35:33 [21500:17605]: shepherd called
> with uid = 0, euid = 21500
> 03/31/2004 15:35:33 [21500:17605]: starting up 5.3p5
> 03/31/2004 15:35:33 [21500:17605]: setpgid(17605,
> 17605) returned 0
> 03/31/2004 15:35:33 [21500:17605]: no prolog script
> to start
> 03/31/2004 15:35:33 [21500:17605]: forked "job" with
> pid 17608
> 03/31/2004 15:35:33 [21500:17608]: pid=17608
> pgrp=17608 sid=17608 old pgrp=17605 getlogin()=<no
> login set>
> 03/31/2004 15:35:33 [21500:17608]: setosjobid: uid =
> 0, euid = 21500
> 03/31/2004 15:35:33 [21500:17605]: child: job - pid:
> 17608
> 03/31/2004 15:35:33 [21500:17608]: RLIMIT_CPU
> setting: (soft 18446744073709551613 hard
> 18446744073709551613) resulting: (soft
> 18446744073709551613 hard 18446744073709551613)
> 03/31/2004 15:35:33 [21500:17608]: RLIMIT_FSIZE
> setting: (soft 18446744073709551613 hard
> 18446744073709551613) resulting: (soft
> 18446744073709551613 hard 18446744073709551613)
> 03/31/2004 15:35:33 [21500:17608]: RLIMIT_DATA
> setting: (soft 18446744073709551613 hard
> 18446744073709551613) resulting: (soft
> 18446744073709551613 hard 18446744073709551613)
> 03/31/2004 15:35:33 [21500:17608]: RLIMIT_STACK
> setting: (soft 18446744073709551613 hard
> 18446744073709551613) resulting: (soft
> 18446744073709551613 hard 18446744073709551613)
> 03/31/2004 15:35:33 [21500:17608]: RLIMIT_CORE
> setting: (soft 18446744073709551613 hard
> 18446744073709551613) resulting: (soft
> 18446744073709551613 hard 18446744073709551613)
> 03/31/2004 15:35:33 [21500:17608]: RLIMIT_VMEM
> setting: (soft 18446744073709551613 hard
> 18446744073709551613) resulting: (soft
> 18446744073709551613 hard 18446744073709551613)
> 03/31/2004 15:35:33 [10166669:17608]: closing all
> filedescriptors
> 03/31/2004 15:35:33 [10166669:17608]: further
> messages are in "error" and "trace"
> 03/31/2004 15:35:33 [10166669:17608]:
>
execvp(/misc/sge/5.3p2/default/spool/sohvi02/job_scripts/191244,
>
/misc/sge/5.3p2/default/spool/sohvi02/job_scripts/191244
> OFDM_UWB_wideCh_1 OFDM_UWB110_4xOS.scf)
> 03/31/2004 15:35:33 [21500:17605]: wait3 returned
> 17608 (status: 6912; WIFSIGNALED: 0,  WIFEXITED: 1,
> WEXITSTATUS: 27)
> 03/31/2004 15:35:33 [21500:17605]: job exited with
> exit status 27
> 03/31/2004 15:35:33 [21500:17605]: reaped "job" with
> pid 17608
> 03/31/2004 15:35:33 [21500:17605]: job exited not
> due to signal
> 03/31/2004 15:35:33 [21500:17605]: now sending
> signal 9 to pid -17608
> 03/31/2004 15:35:33 [21500:17605]: job exited with
> status 27
> 03/31/2004 15:35:33 [21500:17605]: no tasker to
> notify
> 03/31/2004 15:35:33 [21500:17605]: failed starting
> job
> 03/31/2004 15:35:33 [21500:17605]: no epilog script
> to start
> 
> Shepherd error:
> 03/31/2004 15:35:33 [10166669:17608]:
>
execvp(/misc/sge/5.3p2/default/spool/sohvi02/job_scripts/191244,
>
/misc/sge/5.3p2/default/spool/sohvi02/job_scripts/191244
> OFDM_UWB_wideCh_1 OFDM_UWB110_4xOS.scf) failed: No
> such file or directory
> 
> Shepherd pe_hostfile:
> sohvi02.xxxx.xxx 1 sohvi02.q UNDEFINED
> 
> 
> Frans Raulo
> Systems Specialist
> Engineering Tools & Platforms
> TP Product Creation Processes & Services
> Helsinki / Ruoholahti
> Tel. +358 50 4873312
> mailto:frans.raulo at nokia.com 
> 
>
---------------------------------------------------------------------
> To unsubscribe, e-mail:
> users-unsubscribe at gridengine.sunsource.net
> For additional commands, e-mail:
> users-help at gridengine.sunsource.net
> 


__________________________________
Do you Yahoo!?
Yahoo! Small Business $15K Web Design Giveaway 
http://promotions.yahoo.com/design_giveaway/

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list