Hi, Berit<br>
I had the same issue.As you can see when maui updates previous
jobs status using MStatUpdateActiveJobUsage() function segmentation
fault occurs.To resolve this try restartting the pbs server with option
'pbs_server -type cold' to remove previous jobs.Then start
maui.Remember to start maui after you have started the pbs server.If
the previous jobs does'nt get deleted with above option, try using
pbs_sched and remove all the jobs, then restart the server and then
start maui.<br>
<br>
Hope this will<br>
Regards--<br>
<span class="sg">
Rishi Pathak</span><br><br><div><span class="gmail_quote">On 12/13/06, <b class="gmail_sendername">Berit Hinnemann</b> <<a href="mailto:behi@topsoe.dk">behi@topsoe.dk</a>> wrote:</span><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
<div>Hi all,<br>
<br>
I am new to installing Torque PBS and Maui. My system is a one
dual-processor dual-core server for testing purposes, where I try
things out before getting the actual cluster. I have installed both
Torque PBS and this seems to work fine. Then I installed Maui and used
the file maui.cfg as below, aside from telling that the queue system is
PBS I did not change anything.<br>
<br>
Now the behavior is that I can start the 'maui' demon, issue 'showq'
and see the queue, but when I submit a job, the maui demon seems to
stop by itself. Then, when I issue "showq" I get<br>
<br>
[behi@RHE4Server 1proc]$ showq<br>
ERROR: cannot send request to server localhost.localdomain:42559 (server may not be running)<br>
ERROR: cannot request service (status)<br>
<br>
I have appended the lines generated in maui.log below.<br>
The job runs fine and I can also submit several jobs, which are just
done in the order submitted. I can also restart maui and repeat this
procedure.<br>
<br>
Does anybody have an idea where I should be looking to figure out what
is wrong? I would be grateful on any hints on how to get started.<br>
Best, Berit<br>
<br><div>--------------------------------------<br>Berit Hinnemann<br>Research Scientist<br>Haldor Topsøe A/S<br>---------------------------------------<br>
-------------------------------------------------------------------------------------------------------------------------------------<br>
output from maui.log upon submitting a job<br>
12/13 16:23:35 INFO: scheduling complete. sleeping 30 seconds<br>
12/13 16:24:06 ServerProcessRequests()<br>
12/13 16:24:06 INFO: not rolling logs (585245 < 10000000)<br>
12/13 16:24:06 MResAdjust(NULL,0,0)<br>
12/13 16:24:06 MStatInitializeActiveSysUsage()<br>
12/13 16:24:06 MStatClearUsage([NONE],Active)<br>
12/13 16:24:06 ServerUpdate()<br>
12/13 16:24:06 MSysUpdateTime()<br>
12/13 16:24:06 INFO: starting iteration 7<br>
12/13 16:24:06 MRMGetInfo()<br>
12/13 16:24:06 MClusterClearUsage()<br>
12/13 16:24:06 MRMClusterQuery()<br>
12/13 16:24:06 MPBSClusterQuery(localhost.localdomain,RCount,SC)<br>
12/13 16:24:06 __MPBSGetNodeState(Name,State,PNode)<br>
12/13 16:24:06 INFO: PBS node localhost.localdomain set to state Busy (job-exclusive)<br>
12/13 16:24:06 INFO: node 'localhost.localdomain' changed states from Idle to Busy<br>
12/13 16:24:06 ALERT: unexpected node transition on node 'localhost.localdomain' Idle -> Busy<br>
12/13 16:24:06 MPBSNodeUpdate(localhost.localdomain,localhost.localdomain,Busy,localhost.localdomain)<br>
12/13 16:24:06 INFO: node localhost.localdomain has joblist
'0/10.localhost.localdomain, 1/10.localhost.localdomain,
2/10.localhost.localdomain, 3/10.localhost.localdomain'<br>
12/13 16:24:06 ALERT: cannot locate PBS job '10.localhost.localdomain' (running on node localhost.localdomain)<br>
12/13 16:24:06 ALERT: cannot locate PBS job '10.localhost.localdomain' (running on node localhost.localdomain)<br>
12/13 16:24:06 ALERT: cannot locate PBS job '10.localhost.localdomain' (running on node localhost.localdomain)<br>
12/13 16:24:06 ALERT: cannot locate PBS job '10.localhost.localdomain' (running on node localhost.localdomain)<br>
12/13 16:24:06 MPBSLoadQueueInfo(localhost.localdomain,localhost.localdomain,SC)<br>
12/13 16:24:06 INFO: queue 'batch' started state set to True<br>
12/13 16:24:06 INFO: class to node not mapping enabled for queue 'batch' adding class to all nodes<br>
12/13 16:24:06 INFO: 1 PBS resources detected on RM localhost.localdomain<br>
12/13 16:24:06 INFO: resources detected: 1<br>
12/13 16:24:06 MRMWorkloadQuery()<br>
12/13 16:24:06 MPBSWorkloadQuery(localhost.localdomain,JCount,SC)<br>
12/13 16:24:06 MPBSJobLoad(10,10.localhost.localdomain,J,TaskList,0)<br>
12/13 16:24:06 MReqCreate(10,SrcRQ,DstRQ,DoCreate)<br>
12/13 16:24:06 INFO: processing node request line '1:ppn=4'<br>
12/13 16:24:06 MJobSetCreds(10,behi,behi,)<br>
12/13 16:24:06 INFO: default QOS for job 10 set
to DEFAULT(0) (P:DEFAULT,U:[NONE],G:[NONE],A:[NONE],C:[NONE])<br>
12/13 16:24:06 INFO: default QOS for job 10 set
to DEFAULT(0) (P:DEFAULT,U:[NONE],G:[NONE],A:[NONE],C:[NONE])<br>
12/13 16:24:06 INFO: default QOS for job 10 set
to DEFAULT(0) (P:DEFAULT,U:[NONE],G:[NONE],A:[NONE],C:[NONE])<br>
12/13 16:24:06 MResJCreate(10,MNodeList,-00:00:10,ActiveJob,Res)<br>
12/13 16:24:06 MStatUpdateActiveJobUsage(10)<br>
---------------------------------------------------------------------------------------------------------------------------------------<br>
maui.cfg<br>
# maui.cfg 3.2.6p18<br>
<br>
SERVERHOST localhost.localdomain<br>
# primary admin must be first in list<br>
ADMIN1 root<br>
<br>
# Resource Manager Definition<br>
<br>
RMCFG[localhost.localdomain] TYPE=PBS<br>
<br>
# Allocation Manager Definition<br>
<br>
AMCFG[bank] TYPE=NONE<br>
<br>
# full parameter docs at <a href="http://supercluster.org/mauidocs/a.fparameters.html" target="_blank" onclick="return top.js.OpenExtLink(window,event,this)">http://supercluster.org/mauidocs/a.fparameters.html</a><br>
# use the 'schedctl -l' command to display current configuration<br>
<br>
RMPOLLINTERVAL 00:00:30<br>
<br>
SERVERPORT 42559<br>
SERVERMODE NORMAL<br>
<br>
# Admin: <a href="http://supercluster.org/mauidocs/a.esecurity.html" target="_blank" onclick="return top.js.OpenExtLink(window,event,this)">http://supercluster.org/mauidocs/a.esecurity.html</a><br>
<br>
<br>
LOGFILE maui.log<br>
LOGFILEMAXSIZE 10000000<br>
LOGLEVEL 3<br>
<br>
# Job Priority: <a href="http://supercluster.org/mauidocs/5.1jobprioritization.html" target="_blank" onclick="return top.js.OpenExtLink(window,event,this)">http://supercluster.org/mauidocs/5.1jobprioritization.html</a><br>
<br>
QUEUETIMEWEIGHT 1<br>
<br>
# FairShare: <a href="http://supercluster.org/mauidocs/6.3fairshare.html" target="_blank" onclick="return top.js.OpenExtLink(window,event,this)">http://supercluster.org/mauidocs/6.3fairshare.html</a><br>
<br>
#FSPOLICY PSDEDICATED<br>
#FSDEPTH 7<br>
#FSINTERVAL 86400<br>
#FSDECAY 0.80<br>
<br>
# Throttling Policies: <a href="http://supercluster.org/mauidocs/6.2throttlingpolicies.html" target="_blank" onclick="return top.js.OpenExtLink(window,event,this)">http://supercluster.org/mauidocs/6.2throttlingpolicies.html
</a><br>
<br>
# NONE SPECIFIED<br>
<br>
# Backfill: <a href="http://supercluster.org/mauidocs/8.2backfill.html" target="_blank" onclick="return top.js.OpenExtLink(window,event,this)">http://supercluster.org/mauidocs/8.2backfill.html</a><br>
<br>
BACKFILLPOLICY FIRSTFIT<br>
RESERVATIONPOLICY CURRENTHIGHEST<br>
<br>
# Node Allocation: <a href="http://supercluster.org/mauidocs/5.2nodeallocation.html" target="_blank" onclick="return top.js.OpenExtLink(window,event,this)">http://supercluster.org/mauidocs/5.2nodeallocation.html</a><br>
<br>
NODEALLOCATIONPOLICY MINRESOURCE<br>
<br>
# QOS: <a href="http://supercluster.org/mauidocs/7.3qos.html" target="_blank" onclick="return top.js.OpenExtLink(window,event,this)">http://supercluster.org/mauidocs/7.3qos.html</a><br>
<br>
# QOSCFG[hi] PRIORITY=100 XFTARGET=100 FLAGS=PREEMPTOR:IGNMAXJOB<br>
# QOSCFG[low] PRIORITY=-1000 FLAGS=PREEMPTEE<br>
<br>
# Standing Reservations: <a href="http://supercluster.org/mauidocs/7.1.3standingreservations.html" target="_blank" onclick="return top.js.OpenExtLink(window,event,this)">http://supercluster.org/mauidocs/7.1.3standingreservations.html
</a><br>
<br>
# SRSTARTTIME[test] 8:00:00<br>
# SRENDTIME[test] 17:00:00<br>
# SRDAYS[test] MON TUE WED THU FRI<br>
# SRTASKCOUNT[test] 20<br>
# SRMAXTIME[test] 0:30:00<br>
<br>
# Creds: <a href="http://supercluster.org/mauidocs/6.1fairnessoverview.html" target="_blank" onclick="return top.js.OpenExtLink(window,event,this)">http://supercluster.org/mauidocs/6.1fairnessoverview.html</a><br>
<br>
# USERCFG[DEFAULT] FSTARGET=25.0<br>
# USERCFG[john] PRIORITY=100 FSTARGET=10.0-<br>
# GROUPCFG[staff] PRIORITY=1000 QLIST=hi:low QDEF=hi<br>
# CLASSCFG[batch] FLAGS=PREEMPTEE<br>
# CLASSCFG[interactive] FLAGS=PREEMPTOR<br>
<br></div></div>
<br>_______________________________________________<br>mauiusers mailing list<br><a onclick="return top.js.OpenExtLink(window,event,this)" href="mailto:mauiusers@supercluster.org">mauiusers@supercluster.org</a><br><a onclick="return top.js.OpenExtLink(window,event,this)" href="http://www.supercluster.org/mailman/listinfo/mauiusers" target="_blank">
http://www.supercluster.org/mailman/listinfo/mauiusers</a><br><br><br></blockquote></div><br>