<div class="moz-text-flowed" style="font-family: -moz-fixed; font-size: 12px;" lang="x-western">Hi
<br>
<br>I submitted this job on the cluster and the job is deferred. Using
tracejob I get:
<br>
<br>03/14/2006 05:06:17 S unable to run job, MOM rejected/rc=1
<br>_
<br>Using checkjob $PBS_ID_
<br>StartDate: -00:06:36 Tue Mar 14 05:06:18
<br>Total Tasks: 1
<br>
<br>Req[0] TaskCount: 1 Partition: ALL
<br>Network: [NONE] Memory >= 0 Disk >= 0 Swap >= 0
<br>Opsys: [NONE] Arch: [NONE] Features: [NONE]
<br>
<br>
<br>IWD: [NONE] Executable: [NONE]
<br>Bypass: 0 StartCount: 2
<br>PartitionMask: [ALL]
<br>Flags: RESTARTABLE
<br>
<br>job is deferred. Reason: RMFailure (cannot start job - RM failure,
rc: 15041, msg: 'Execution server rejected request MSG=send failed,
STARTING')
<br>Holds: Defer (hold reason: RMFailure)
<br>PE: 1.00 StartPriority: 1
<br>cannot select job 99950 for partition DEFAULT (job hold active)
<br>
<br>Please advice
<br>
<br>Gaurav
<br>
<br></div>