Dear all,<br><br>Excuse me for the long message I'm trying to explain well my problem.<br><br>To use OpenAFS and Kerberos I'm trying to use the svn version of torque with gssapi support. The compilation ends fine.<br>
<br>When I start the server I see the following log:<br><br>Feb 27 12:33:25 v6-enmr PBS_Server: No such file or directory (2) in job_recov, Unable to read /var/spool/pbs/server_priv/jobs/5.v6-enmr.cerm.unifi.it.JB<br>Feb 27 12:33:25 v6-enmr PBS_Server: pbsd_init, Recover of job 5.v6-enmr.cerm.unifi.it.JB failed<br>
Feb 27 12:33:25 v6-enmr PBS_Server: Connection refused (111) in contact_sched, Could not contact Scheduler - port 15004 cannot bind to port 1023 in client_to_svr - connection refused<br>Feb 27 12:33:25 v6-enmr pbsserver: pbs_server startup succeeded<br>
Feb 27 12:33:25 v6-enmr su(pam_unix)[30128]: session opened for user maui by root(uid=0)<br>Feb 27 12:33:25 v6-enmr su(pam_unix)[30128]: session closed for user maui<br>Feb 27 12:33:25 v6-enmr su(pam_unix)[30131]: session opened for user maui by root(uid=0)<br>
Feb 27 12:33:26 v6-enmr su: INFO: starting Maui version 3.2.6p20 ##################<br>Feb 27 12:33:26 v6-enmr su: INFO: new LOGLEVEL value (3)<br>Feb 27 12:33:26 v6-enmr su: INFO: detected array index '0'<br>
Feb 27 12:33:26 v6-enmr su: MCfgProcessLine(RMHOST,0,<a href="http://v6-enmr.cerm.unifi.it">v6-enmr.cerm.unifi.it</a>)<br>Feb 27 12:33:26 v6-enmr su: MCfgProcessLine(RMPOLLINTERVAL,,00:00:10)<br>Feb 27 12:33:26 v6-enmr su: MCfgSetVal(RMPOLLINTERVAL,IVal,DVal,SVal,SArray,P)<br>
Feb 27 12:33:26 v6-enmr su: MUTimeFromString(00:00:10)<br>Feb 27 12:33:26 v6-enmr su: INFO: detected array index '0'<br>Feb 27 12:33:26 v6-enmr su: MCfgProcessLine(RMHOST,0,<a href="http://v6-enmr.cerm.unifi.it">v6-enmr.cerm.unifi.it</a>)<br>
Feb 27 12:33:26 v6-enmr su(pam_unix)[30131]: session closed for user maui<br>Feb 27 12:33:26 v6-enmr su: INFO: detected array index '0'<br>Feb 27 12:33:26 v6-enmr su: MCfgProcessLine(RMTYPE,0,PBS)<br>Feb 27 12:33:26 v6-enmr su: MUGetIndex(PBS,ValList,0)<br>
Feb 27 12:33:26 v6-enmr su: MCfgProcessLine(SERVERHOST,,<a href="http://v6-enmr.cerm.unifi.it">v6-enmr.cerm.unifi.it</a>)<br>Feb 27 12:33:26 v6-enmr su: MCfgSetVal(SERVERHOST,IVal,DVal,SVal,SArray,P)<br>Feb 27 12:33:26 v6-enmr su: INFO: starting scheduler on '<a href="http://v6-enmr.cerm.unifi.it">v6-enmr.cerm.unifi.it</a>'<br>
Feb 27 12:33:26 v6-enmr su: MCfgProcessLine(SERVERMODE,,NORMAL)<br>Feb 27 12:33:26 v6-enmr su: MCfgSetVal(SERVERMODE,IVal,DVal,SVal,SArray,P)<br>Feb 27 12:33:26 v6-enmr su: MUGetIndex(NORMAL,ValList,1)<br>Feb 27 12:33:26 v6-enmr su: MCfgProcessLine(SERVERPORT,,40559)<br>
Feb 27 12:33:26 v6-enmr su: MCfgSetVal(SERVERPORT,IVal,DVal,SVal,SArray,P)<br>Feb 27 12:33:26 v6-enmr su: MAMSetDefaults()<br>Feb 27 12:33:26 v6-enmr su: ServerProcessArgs(1,ArgV,0)<br>Feb 27 12:33:26 v6-enmr su: MUGetOpt(1,ArgV,a:Ab:B:c:C:dD:f:hH:i:j:l:L:m:n:N:p:P:r:s:v?-:,OptArg)<br>
Feb 27 12:33:26 v6-enmr su: ServerDemonize()<br>Feb 27 12:33:26 v6-enmr su: INFO: child process in background<br>Feb 27 12:33:26 v6-enmr su: ServerAuthenticate()<br>Feb 27 12:33:26 v6-enmr su: MFULock(/var/spool/maui/,/var/spool/maui/maui.pid)<br>
Feb 27 12:33:26 v6-enmr su: INFO: executing scheduler from '/var/spool/maui/' under UID 7721 GID 7721<br>Feb 27 12:33:26 v6-enmr su: SDRGetSystemConfig()<br>Feb 27 12:33:26 v6-enmr su: MSysStartServer()<br>Feb 27 12:33:26 v6-enmr su: starting 3.2.6p20 version Maui (PID: 30132) on Wed Feb 27 12:33:25<br>
Feb 27 12:33:26 v6-enmr su: MSysMemCheck()<br>Feb 27 12:33:26 v6-enmr su: MNode[5120] 0.02<br>Feb 27 12:33:26 v6-enmr su: MJob[32768] 0.12<br>Feb 27 12:33:26 v6-enmr su: MJobTraceBuffer[32768] 0.00<br>
Feb 27 12:33:26 v6-enmr su: MUser[1792] 0.01<br>Feb 27 12:33:26 v6-enmr su: MGroup[1792] 2.06<br>Feb 27 12:33:26 v6-enmr su: MAcct[1792] 2.06<br>Feb 27 12:33:27 v6-enmr su: MRes[8192] 0.03<br>
Feb 27 12:33:27 v6-enmr su: SRes[ 128] 2.39<br>Feb 27 12:33:27 v6-enmr su: MStatInitialize(P)<br>Feb 27 12:33:27 v6-enmr su: MStatProfInitialize(P)<br>Feb 27 12:33:27 v6-enmr su: MStatOpenFile(1204112005)<br>
Feb 27 12:33:27 v6-enmr su: WARNING: cannot open statfile '/var/spool/maui/stats/Wed_Feb_27_2008', errno: 13 (Permission denied)<br>Feb 27 12:33:27 v6-enmr su: VERSION 230<br>Feb 27 12:33:27 v6-enmr su: MSUListen(S)<br>
Feb 27 12:33:27 v6-enmr su: INFO: opened service socket on port 40559<br>Feb 27 12:33:27 v6-enmr su: MSUListen(S)<br>Feb 27 12:33:27 v6-enmr su: INFO: opened service socket on port 40560<br>Feb 27 12:33:27 v6-enmr su: MFSInitialize()<br>
Feb 27 12:33:27 v6-enmr su: MCPLoad(/var/spool/maui/maui.ck,ResOnly)<br>Feb 27 12:33:27 v6-enmr su: MRMInitialize()<br>Feb 27 12:33:27 v6-enmr su: MPBSInitialize(0,SC)<br>Feb 27 12:33:27 v6-enmr su: INFO: parent is exiting<br>
Feb 27 12:33:27 v6-enmr pbsserver: su startup succeeded<br><br><br>There is a "Connection refused" that I don't understand.<br><br>The infos from checknode seems correct:<br><br>checking node <a href="http://wn5-enmr.cerm.unifi.it">wn5-enmr.cerm.unifi.it</a><br>
<br>State: Idle (in current state for 00:01:06)<br>Configured Resources: PROCS: 1 MEM: 1024M SWAP: 3004M DISK: 1M<br>Utilized Resources: [NONE]<br>Dedicated Resources: [NONE]<br>Opsys: linux Arch: [NONE]<br>
Speed: 1.00 Load: 0.000<br>Network: [DEFAULT]<br>Features: [NONE]<br>Attributes: [Batch]<br>Classes: [batch 1:1]<br><br>Total Time: 00:01:15 Up: 00:01:04 (85.33%) Active: 00:00:00 (0.00%)<br><br>Reservations:<br>
NOTE: no reservations on node<br><br><br><br>Also the infos from showq seems correct:<br><br>ACTIVE JOBS--------------------<br>JOBNAME USERNAME STATE PROC REMAINING STARTTIME<br><br><br> 0 Active Jobs 0 of 1 Processors Active (0.00%)<br>
<br>IDLE JOBS----------------------<br>JOBNAME USERNAME STATE PROC WCLIMIT QUEUETIME<br><br><br>0 Idle Jobs<br><br>BLOCKED JOBS----------------<br>JOBNAME USERNAME STATE PROC WCLIMIT QUEUETIME<br>
<br><br>Total Jobs: 0 Active Jobs: 0 Idle Jobs: 0 Blocked Jobs: 0<br><br><br>But when I try to submit a job (qsub pbsrun -q batch) I receive:<br>qsub: Unknown queue MSG=cannot save creds<br><br>and pbs_server died without messages.<br>
<br>The qmgr prints:<br>Qmgr: p s<br>#<br># Create queues and set their attributes.<br>#<br>#<br># Create and define queue batch<br>#<br>create queue batch<br>set queue batch queue_type = Execution<br>set queue batch resources_default.nodes = 1<br>
set queue batch resources_default.walltime = 01:00:00<br>set queue batch acl_groups = users<br>set queue batch enabled = True<br>set queue batch started = True<br>#<br># Set server attributes.<br>#<br>set server scheduling = True<br>
set server acl_hosts = <a href="http://wn5-enmr.cerm.unifi.it">wn5-enmr.cerm.unifi.it</a><br>set server acl_hosts += <a href="http://v6-enmr.cerm.unifi.it">v6-enmr.cerm.unifi.it</a><br>set server managers = <a href="mailto:afsadm/admin@CERM.UNIFI.IT">afsadm/admin@CERM.UNIFI.IT</a><br>
set server operators = <a href="mailto:afsadm/admin@CERM.UNIFI.IT">afsadm/admin@CERM.UNIFI.IT</a><br>set server default_queue = batch<br>set server log_events = 511<br>set server mail_from = adm<br>set server scheduler_iteration = 600<br>
set server node_check_rate = 150<br>set server tcp_timeout = 6<br>set server mom_job_sync = True<br>set server keep_completed = 300<br>set server next_job_number = 7<br><br> <br>Someone has any idea?<br><br>Thanks a lot<br>
Enrico<br><br><br>