[Moabusers] job disappeared from qstat and showq, still running on node
Pim Schravendijk
schraven at csi.tu-darmstadt.de
Mon Mar 21 18:12:03 MDT 2011
> If qstat isn't reporting the jobs (as you've reported)
> then Torque doesn't know about them.
>
> The pbs_mom might think they're there but if the
> pbs_server has lost them then it as far as it (and
> hence Moab) are concerned then they've gone, and
> that's a bug in Torque..
Yes this is completely correct. Updating torque from 2.5.2 to 2.5.4
will be necessary.
On Tue, Mar 22, 2011 at 12:48 AM, Christopher Samuel
<samuel at unimelb.edu.au> wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> On 22/03/11 10:11, Pim Schravendijk wrote:
>
>> Clearly, torque knows which jobs are running but
>> either doesn't tell this to moab, or moab doesn't
>> read it from torque.
>
> If qstat isn't reporting the jobs (as you've reported)
> then Torque doesn't know about them.
>
> The pbs_mom might think they're there but if the
> pbs_server has lost them then it as far as it (and
> hence Moab) are concerned then they've gone, and
> that's a bug in Torque..
>
> Which version of Torque are you running ?
>
> cheers!
> Chris
> - --
> Christopher Samuel - Senior Systems Administrator
> VLSCI - Victorian Life Sciences Computation Initiative
> Email: samuel at unimelb.edu.au Phone: +61 (0)3 903 55545
> http://www.vlsci.unimelb.edu.au/
>
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.10 (GNU/Linux)
> Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/
>
> iEYEARECAAYFAk2H4+EACgkQO2KABBYQAh+CsgCfRCnPVhf/oAletLIrK7PWQIAF
> t0wAoIc8VFq3sy6FgXJfJQATmCs2aA7j
> =kTRJ
> -----END PGP SIGNATURE-----
> _______________________________________________
> moabusers mailing list
> moabusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/moabusers
>
>
More information about the moabusers
mailing list