[Moabusers] trouble setting node based triggers

Douglas Wightman wightman at clusterresources.com
Mon Aug 20 14:22:34 MDT 2007


I'm seeing that the trigger has fired: State is Successful.

Moab will send an email to the primary administrator, if this is root
then it will attempt to send an email to root at the localhost.

What email address did you want Moab to send the email to?

- Douglas

On Mon, 2007-08-20 at 15:58 -0400, Martins, Flavio wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA256
> 
> Thanks for the info Douglas, that was a partial help. I see the triggers
> listed with mdiag -T now, but they are still not working right. I still
> don't see any gmetric entry in the mdiag -n output and my GLOBAL node
> trigger is not firing.
> 
> Here is the output from "checknode -v GLOBAL"
> 
> node GLOBAL
> 
> State:      Idle  (in current state for 1:53:28)
> Configured Resources:<license data> 
> Utilized   Resources:<license data>   
> Dedicated  Resources: ---
> Generic Metrics:  diskuse=0.55
>   MTBF(longterm):   INFINITY  MTBF(24h):   INFINITY
> Partition:  ALL  Rack/Slot:  ---
> Flags:      havegresavailinfo,rmdetected
> RM[ANSYSFlx TYPE=NATIVE:AGFULL
> NodeAccessPolicy: SHARED
> 
> Total Time: 2:14:02  Up: 2:14:02 (100.00%)  Active: 00:00:00 (0.00%)
> 
> Reservations:  ---
> TrigID               Object ID              Event  AType   ActionDate
> State
> - --------------------- -------------------- -------- ------
> - -----------------
> 33*                   node:GLOBAL          threshol  mail      -
> Successful
>   Launch Time: -1:53:28
>   BlockTime:    INFINITY  ActiveTime:  00:00:00
>   Threshold:   GMetric[diskuse] > 0.50
>   Action Data: Master node exceeded 0.5 diskusage
>   Variables=
> 
> * indicates trigger has completed
> 
> As you can see, checknode correctly shows gmetric[diskuse]=0.55 and yet
> the 
> Trigger for node GLOBAL does not fire for the condition gmetric > 0.5
> 
> Perhaps there is something unique about the "GLOBAL" node that is
> interfering here. All other nodes on the cluster have diskusage well
> below the 0.5 value. 
> 
> Flavio Martins
> Senior Engineer
> Aerodynamics / CFD
> Florida Turbine Technologies Inc.
> 100 Marquette Road
> Suite 110
> Jupiter, FL 33458-7101
> Phone: (561) 427-6261
> Fax: (561) 427-6191
> 
> - -----Original Message-----
> From: Douglas Wightman [mailto:wightman at clusterresources.com] 
> Sent: Monday, August 20, 2007 12:50 PM
> To: Martins, Flavio
> Cc: moabusers at supercluster.org
> Subject: Re: [Moabusers] trouble setting node based triggers
> 
> If you don't capitalize GMETRIC then this  will work:
> 
> Change:
> 
> NODECFG[DEFAULT] TRIGGER=atype=mail,action='Node $OID exceeded 0.5
> diskusage',etype=threshold,threshold=GMETRIC[diskuse]>0.5
> 
> to
> 
> NODECFG[DEFAULT] TRIGGER=atype=mail,action='Node $OID exceeded 0.5
> diskusage',etype=threshold,threshold=gmetric[diskuse]>0.5
> 
> And the triggers will show up in mdiag -T.
> 
> - - Douglas
> 
> 
> 
> On Fri, 2007-08-17 at 18:51 -0400, Martins, Flavio wrote:
> > I have been trying to set a node based trigger to e-mail me if disk
> > space runs low. I set up a native RM to query disk usage on my master
> > node and my compute nodes and report the usage number as a GMETRIC. I
> > then set a mail trigger to fire if disk usage surpassed a certain
> > percentage. The problem is that so far I have not been able to get
> > this to work. 
> > 
> >  
> > 
> > Some general observations:
> > 
> > The gmetric values shows up if I do a nodecheck, but not on mdiag -n. 
> > 
> > I have no listings from the mdiag -T command, so the triggers don't
> > seem to be picked up.
> > 
> > I don't see any errors or alerts in the moab.log file about the
> > triggers.
> > 
> >  
> > 
> > Here is how I tried to set it up:
> > 
> >  
> > 
> > Here is the native RM to get disk usage data - moab.cfg
> > 
> > RMCFG[disk] TYPE=NATIVE RESOURCETYPE=FS
> > 
> > RMCFG[disk] CLUSTERQUERYURL=exec:///opt/moab/tools/disk_check.pl 
> > 
> >  
> > 
> > Disk_check.pl produces the following output:
> > 
> > GLOBAL  GMETRIC[diskuse]=0.553221342092917
> > 
> > 0       GMETRIC[diskuse]=0.000442725244643413
> > 
> > 1       GMETRIC[diskuse]=0.000442725244643413
> > 
> > 2       GMETRIC[diskuse]=0.000499726374230815
> > 
> > 3       GMETRIC[diskuse]=0.000442725244643413
> > 
> > 4       GMETRIC[diskuse]=0.000442725244643413
> > 
> > 5       GMETRIC[diskuse]=0.334928463873042
> > 
> > 6       GMETRIC[diskuse]=0.228136833057353
> > 
> > (My master node is not available for running jobs, so I assigned its
> > disk usage to the global node for trigger setting purposes)
> > 
> >  
> > 
> > Then I set up my triggers based on the moab documentation found here:
> >
> http://www.clusterresources.com/products/mwm/docs/9.2accounting.shtml#gm
> etric
> > 
> > The example on this page is nearly identical to what I am trying to
> > do.
> > 
> >  
> > 
> > NODECFG[DEFAULT] TRIGGER=atype=mail,action='Node $OID exceeded 0.5
> > diskusage',etype=threshold,threshold=GMETRIC[diskuse]>0.5
> > 
> > NODECFG[GLOBAL]  TRIGGER=atype=mail,action='Master node exceeded 0.5
> > diskusage',etype=threshold,threshold=GMETRIC[diskuse]>0.5
> > 
> >  
> > 
> > The disk usage on the GLOBAL node is greater then 0.5 so the trigger
> > should fire.
> > 
> >  
> > 
> > Can anyone see anything wrong with this setup?
> > 
> >  
> > 
> > Flavio Martins
> > 
> > Senior Engineer - Aerodynamics / CFD
> > 
> > Florida Turbine Technologies Inc.
> > 
> > 100 Marquette Road, Suite 110
> > 
> > Jupiter, FL 33458-7101
> > 
> > -----------------------------------------------------
> > 
> > Phone: (561) 427-6261
> > 
> > Fax: (561) 427-6191
> > 
> >  
> > 
> > 
> > _______________________________________________
> > moabusers mailing list
> > moabusers at supercluster.org
> > http://www.supercluster.org/mailman/listinfo/moabusers
> 
> 
> -----BEGIN PGP SIGNATURE-----
> Version: PGP Universal 2.6.1
> Charset: us-ascii
> 
> wsBVAwUBRsnx3jxaeRbmFE+LAQibFggA6WUfSbI6su3rH2E+HkNID0UtGxclcaXq
> qYv8cE7sZ5RjHHge5k0ytnlfFYpky48fyrXhK5ahnHBFg+4WY/ZmebpaaI5H691R
> dgnsvPqUmAJek8nTdYptrueGfpDaZXRO97Bwxk7NFiY8k3qCN1zBSb2bQnl3d8zX
> rDDgfKYy6X2cHU02zVDrqTuc+K8Sc/Vg8pUxU2RTvcbsmy8OIrTYCfBrmrNIGalU
> Q1BMtwUxMsCpX1YzffVD1RJmudxQEI3RqJZa4QuD7R050IcTpQQJt0noydy3m1tq
> PlUNyQHUMvFdiVirrfHp4jV5Mghiwqv1aIpssr49Lpin33bm427SOQ==
> =VL8G
> -----END PGP SIGNATURE-----
> _______________________________________________
> moabusers mailing list
> moabusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/moabusers



More information about the moabusers mailing list