Hi all<br>I ve installed torque 2.4.6 and enabled the blcr with the following option while installing<br><br><b>./configure --disable-gui --with-server-home=/var/spool/PBS --with-default-server=gcluster.grid --enable-unixsockets=no --enable-blcr --disable-gcc-warnings<br>
<br></b>Also my mom_priv/config looks like<br><br>/var/spool/PBS/mom_priv/config<br>$checkpoint_script /var/spool/PBS/mom_priv/blcr_checkpoint_script<br>$restart_script /var/spool/PBS/mom_priv/blcr_restart_script<br>$checkpoint_run_exe /usr/local/bin/cr_run<br>
$pbsserver gcluster.grid<br>$loglevel 7<br><br><br>I ve created blcr_checkpoint_script & blcr_restart_script scripts too<br><br>While job submission Im getting the following error .. Please help me to solve this error.. Is there any thing else to be configured for this??<br>
<br>[guser02@gcluster ~]$ qsub -c enabled,periodic,shutdown,interval=1 test.sh<br>1.gcluster.grid<br><br>[guser02@gcluster ~]$ qhold 1<br><br>[guser02@gcluster ~]$ qstat<br>Job id Name User Time Use S Queue<br>
------------------------- ---------------- --------------- -------- - -----<br>1.gcluster test.sh guser02 0 R workq<br><br>[guser02@gcluster ~]$ qstat -f<br>Job Id: 1.gcluster.grid<br>
Job_Name = test.sh<br> Job_Owner = guser02@gcluster.grid<br> job_state = R<br> queue = workq<br> server = gcluster.grid<br> Checkpoint = enabled,periodic,shutdown,interval=1<br> ctime = Fri Mar 26 17:20:03 2010<br>
Error_Path = gcluster.grid:/home/guser02/test.sh.e1<br> exec_host = gcluster.grid/0<br> Hold_Types = n<br> Join_Path = n<br> Keep_Files = n<br> Mail_Points = a<br> mtime = Fri Mar 26 17:20:05 2010<br>
Output_Path = gcluster.grid:/home/guser02/test.sh.o1<br> Priority = 0<br> qtime = Fri Mar 26 17:20:03 2010<br> Rerunable = True<br> Resource_List.nodect = 1<br> Resource_List.nodes = 1<br> session_id = 19993<br>
Variable_List = PBS_O_HOME=/home/guser02,PBS_O_LOGNAME=guser02,<br> PBS_O_PATH=/usr/local/firefox/:/opt/mpich-1.2.6/bin:/usr/local/jdk1.5<br> .0_03/bin/:/usr/local/bin:/bin:/usr/bin:/usr/X11R6/bin:/bin:/usr/local<br>
/tomcat-5.0.27/bin:/usr/local/ant-1.6.4/bin:/usr/local/globus-4.0.3/bi<br> n:/usr/local/globus-4.0.3/sbin:/bin:/usr/local/maui/bin:/usr/local/gw/<br> bin:/usr/local/rrdtool/bin:/opt/ganglia/bin:/usr/local/sbin:/usr/local<br>
/bin:/usr/local/pdftk-1.41/pdftk:/home/guser02/bin,<br> PBS_O_MAIL=/var/spool/mail/guser02,PBS_O_SHELL=/bin/bash,<br> PBS_O_HOST=gcluster.grid,PBS_SERVER=gcluster.grid,<br> PBS_O_WORKDIR=/home/guser02,PBS_O_QUEUE=workq<br>
comment = Scalar found where operator expected at /var/spool/PBS/mom_priv/<br> blcr_checkpoint_script line 31,<br> near "$signalNum $depth"<br> (Missing operator before $depth?)<br>syntax e<br>
rror at /var/spool/PBS/mom_priv/blcr_checkpoint_script line 31,<br> near "$signalNum $depth"<br>Global symbol "$depth" requires explicit pa<br> ckage name at /var/spool/PBS/mom_priv/blcr_checkpoint_script line 31.<br>
<br> Execution of /var/spool/PBS/mom_priv/blcr_checkpoint_script aborted du<br> e to compilation errors.<br><br> etime = Fri Mar 26 17:20:03 2010<br> submit_args = -c enabled,periodic,shutdown,interval=1 test.sh<br>
start_time = Fri Mar 26 17:20:03 2010<br> start_count = 1<br> fault_tolerant = False<br><br><br>Regards<br>Rajiv R<br>Project Associate,<br>CARE,MIT,<br>Anna university ,Chennai<br>