[Moabusers] Moab keeps on trying after pbs_mom rejects.
Chris Samuel
csamuel at vpac.org
Sun Dec 3 22:09:58 MST 2006
On Thursday 23 November 2006 03:09, wightman wrote:
> Have a look at:
>
> http://www.clusterresources.com/products/mwm/docs/a.fparameters.shtml#nodef
>ailurereservetime
>
> When Moab knows which node is causing problems this parameter will tell
> Moab to put a reservation on the node, thus taking it out of the pool of
> feasible nodes.
We've been using this successfully, but how do you tell it to mark a node back
online again afterwards when the problems are fixed and the script no longer
returns the error message ?
We've tried clearing the messages out of the mom using momctl, but Moab seems
to be caching them somewhere & we can't bring the nodes back again. :-(
Help!
cheers,
Chris
--
Christopher Samuel - (03)9925 4751 - VPAC Deputy Systems Manager
Victorian Partnership for Advanced Computing http://www.vpac.org/
Bldg 91, 110 Victoria Street, Carlton South, VIC 3053, Australia
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
Url : http://www.supercluster.org/pipermail/moabusers/attachments/20061204/1baa0237/attachment.bin
More information about the moabusers
mailing list