[Moabusers] Moab keeps on trying after pbs_mom rejects.

Chris Samuel csamuel at vpac.org
Sun Dec 3 22:09:58 MST 2006


On Thursday 23 November 2006 03:09, wightman wrote:

> Have a look at:
>
> http://www.clusterresources.com/products/mwm/docs/a.fparameters.shtml#nodef
>ailurereservetime
>
> When Moab knows which node is causing problems this parameter will tell
> Moab to put a reservation on the node, thus taking it out of the pool of
> feasible nodes.

We've been using this successfully, but how do you tell it to mark a node back 
online again afterwards when the problems are fixed and the script no longer 
returns the error message ?

We've tried clearing the messages out of the mom using momctl, but Moab seems 
to be caching them somewhere & we can't bring the nodes back again. :-(

Help!

cheers,
Chris
-- 
 Christopher Samuel - (03)9925 4751 - VPAC Deputy Systems Manager
 Victorian Partnership for Advanced Computing http://www.vpac.org/
 Bldg 91, 110 Victoria Street, Carlton South, VIC 3053, Australia

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
Url : http://www.supercluster.org/pipermail/moabusers/attachments/20061204/1baa0237/attachment.bin


More information about the moabusers mailing list