this probably isn't much help, but this is what the torque exit statuses mean:<br><br><br>#define JOB_EXEC_OK 0 /* job exec successful */<br>
#define JOB_EXEC_FAIL1 -1 /* job exec failed, before files, no retry */<br>#define JOB_EXEC_FAIL2 -2 /* job exec failed, after files, no retry */<br>#define JOB_EXEC_RETRY -3 /* job execution failed, do retry */<br>
#define JOB_EXEC_INITABT -4 /* job aborted on MOM initialization */<br>#define JOB_EXEC_INITRST -5 /* job aborted on MOM init, chkpt, no migrate */<br>#define JOB_EXEC_INITRMG -6 /* job aborted on MOM init, chkpt, ok migrate */<br>
#define JOB_EXEC_BADRESRT -7 /* job restart failed */<br>#define JOB_EXEC_CMDFAIL -8 /* exec() of user command failed */<br><br><br><br><div class="gmail_quote">On Tue, Apr 15, 2008 at 9:47 AM, SCIPIONI Roberto <<a href="mailto:SCIPIONI.Roberto@nims.go.jp">SCIPIONI.Roberto@nims.go.jp</a>> wrote:<br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">Hi there,<br>
<br>
<br>
I am using Torque in a Linux Cluster<br>
at times my jobs go into running status "R"<br>
but actually they dont run<br>
<br>
I am using LAM MPI and compiled with tm support<br>
<br>
when the jobs "runs" without running at the end of<br>
<br>
qstat -f<br>
<br>
<br>
gives<br>
<br>
<br>
exit status = -3<br>
<br>
Any idea what it means ?<br>
<br>
<br>
Roberto Scipioni<br>
ICYS Research Fellow<br>
ICYS Cluster Manager<br>
ICYS, NIMS<br>
Japan<br>
<br>
<br>
_______________________________________________<br>
torqueusers mailing list<br>
<a href="mailto:torqueusers@supercluster.org">torqueusers@supercluster.org</a><br>
<a href="http://www.supercluster.org/mailman/listinfo/torqueusers" target="_blank">http://www.supercluster.org/mailman/listinfo/torqueusers</a><br>
</blockquote></div><br>