<HTML>
<HEAD>
<TITLE>Re: [torqueusers] simple (I hope) /etc/hosts question</TITLE>
</HEAD>
<BODY>
<FONT FACE="Verdana, Helvetica, Arial"><SPAN STYLE='font-size:14.0px'>In $PBS_HOME/mom_priv/config<BR>
<BR>
Add the line $pbsserver <name of host with pbs_server running><BR>
<BR>
<BR>
Jerry<BR>
<BR>
<BR>
<BR>
<BR>
Torque was really easy to install, but it seems like my /etc/hosts file must be screwed up, as I can't get the cluster nodes to respond. Specifically, within a cluster of 3 machines, each having an /etc/hosts file of: <BR>
<BR>
127.0.0.1 <a href="http://127.0.0.1"><http://127.0.0.1></a> localhost.localdomain localhost<BR>
199.17.152.17 <a href="http://199.17.152.17"><http://199.17.152.17></a> runner<BR>
199.17.152.135 <a href="http://199.17.152.135"><http://199.17.152.135></a> muscovey <BR>
199.17.152.13 <a href="http://199.17.152.13"><http://199.17.152.13></a> pekin<BR>
(( other workstations follow ))<BR>
<BR>
Now, when I have the pbs_server running on runner, and the pbs_mom daemons running on muscovey, pekin, and runner, I et the following status message, <BR>
<BR>
[root@runner torque-2.1.6]# pbsnodes -a<BR>
pekin<BR>
state = down<BR>
np = 1<BR>
ntype = cluster<BR>
<BR>
muscovey<BR>
state = down<BR>
np = 1<BR>
ntype = cluster <BR>
<BR>
runner<BR>
state = down <BR>
np = 1<BR>
ntype = cluster<BR>
<BR>
I realize this is a pretty low-level question, but what the heck is wrong with my /etc/hosts file?<BR>
<BR>
regards,<BR>
<BR>
NT<BR>
<BR>
<BR>
ps, the trouble shooting message given by torque is,<BR>
<BR>
[root@runner torque-2.1.6]# momctl -d 3<BR>
<BR>
Host: runner/runner Version: 2.1.6<BR>
WARNING: server not specified (set $pbsserver) <BR>
PID: 30531<BR>
HomeDirectory: /var/spool/torque/mom_priv<BR>
MOM active: 2518 seconds<BR>
Server Update Interval: 45 seconds<BR>
LOGLEVEL: 0 (use SIGUSR1/SIGUSR2 to adjust) <BR>
Communication Model: RPP<BR>
TCP Timeout: 20 seconds<BR>
NOTE: no prolog configured<BR>
Alarm Time: 0 of 10 seconds<BR>
Trusted Client List: 199.17.152.17 <a href="http://199.17.152.17"><http://199.17.152.17></a> ,127.0.0.1 <a href="http://127.0.0.1"><http://127.0.0.1></a> <BR>
Configured to use /usr/bin/scp -rpB<BR>
NOTE: no local jobs detected<BR>
<BR>
diagnostics complete<BR>
<BR>
<BR>
- - - - - - - - - - - - - - - - - - - - - <BR>
Nathan Moore<BR>
Assistant Professor, Physics<BR>
Winona State University<BR>
AIM: nmoorewsu <BR>
- - - - - - - - - - - - - - - - - - - - -<BR>
<HR ALIGN=CENTER SIZE="3" WIDTH="95%"></SPAN></FONT><SPAN STYLE='font-size:14.0px'><FONT FACE="Monaco, Courier New">_______________________________________________<BR>
torqueusers mailing list<BR>
torqueusers@supercluster.org<BR>
<a href="http://www.supercluster.org/mailman/listinfo/torqueusers">http://www.supercluster.org/mailman/listinfo/torqueusers</a><BR>
</FONT></SPAN>
</BODY>
</HTML>