<br><h3 class="gD" style="color: rgb(121, 6, 25);"><span>Hi Jazcek Braden</span></h3><br>Hi Ive modified that <b> blcr_checkpoint_script </b>and removed that "depth" variable and again submitted that test job and I cant hold that jobs ..Job still remains in running state .. The steps ive done are as follows .. Pls help me to solve this issue<br>
<br>[guser02@gcluster ~]$ qsub -c enabled,periodic,shutdown,interval=1 test.sh<br><br>guser02@gcluster ~]$ qhold 8<br><br>[guser02@gcluster ~]$ qstat<br>Job id Name User Time Use S Queue<br>
------------------------- ---------------- --------------- -------- - -----<br>8.gcluster test.sh guser02 0 R workq<br><br><br>[guser02@gcluster ~]$ qstat -f<br>Job Id: 8.gcluster.grid<br>
Job_Name = test.sh<br> Job_Owner = guser02@gcluster.grid<br> job_state = R<br> queue = workq<br> server = gcluster.grid<br> Checkpoint = enabled,periodic,shutdown,interval=1<br> ctime = Fri Mar 26 19:07:07 2010<br>
Error_Path = gcluster.grid:/home/guser02/test.sh.e8<br> exec_host = gcluster.grid/0<br> Hold_Types = n<br> Join_Path = n<br> Keep_Files = n<br> Mail_Points = a<br> mtime = Fri Mar 26 19:07:12 2010<br>
Output_Path = gcluster.grid:/home/guser02/test.sh.o8<br> Priority = 0<br> qtime = Fri Mar 26 19:07:07 2010<br> Rerunable = True<br> Resource_List.nodect = 1<br> Resource_List.nodes = 1<br> session_id = 20882<br>
Variable_List = PBS_O_HOME=/home/guser02,PBS_O_LOGNAME=guser02,<br> PBS_O_PATH=/usr/local/firefox/:/opt/mpich-1.2.6/bin:/usr/local/jdk1.5<br> .0_03/bin/:/usr/local/bin:/bin:/usr/bin:/usr/X11R6/bin:/bin:/usr/local<br>
/tomcat-5.0.27/bin:/usr/local/ant-1.6.4/bin:/usr/local/globus-4.0.3/bi<br> n:/usr/local/globus-4.0.3/sbin:/bin:/usr/local/maui/bin:/usr/local/gw/<br> bin:/usr/local/rrdtool/bin:/opt/ganglia/bin:/usr/local/sbin:/usr/local<br>
/bin:/usr/local/pdftk-1.41/pdftk:/home/guser02/bin,<br> PBS_O_MAIL=/var/spool/mail/guser02,PBS_O_SHELL=/bin/bash,<br> PBS_O_HOST=gcluster.grid,PBS_SERVER=gcluster.grid,<br> PBS_O_WORKDIR=/home/guser02,PBS_O_QUEUE=workq<br>
comment = Usage: /var/spool/PBS/mom_priv/blcr_checkpoint_script<br><br> etime = Fri Mar 26 19:07:07 2010<br> submit_args = -c enabled,periodic,shutdown,interval=1 test.sh<br> start_time = Fri Mar 26 19:07:07 2010<br>
start_count = 1<br> fault_tolerant = False<br><br><br><div class="gmail_quote">On Fri, Mar 26, 2010 at 5:15 PM, Jazcek Braden <span dir="ltr"><<a href="mailto:jazcek@gmail.com">jazcek@gmail.com</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">There is a typo in the script in the documentation, they try to use a<br>
variable called depth without defining it in the my statement a few<br>
lines up<br>
<br>
-- Jazcek<br>
<div><div></div><div class="h5"><br>
On Fri, Mar 26, 2010 at 6:59 AM, Rajiv Rajaian <<a href="mailto:rajiv.care@gmail.com">rajiv.care@gmail.com</a>> wrote:<br>
> Hi all<br>
> I ve installed torque 2.4.6 and enabled the blcr with the following option<br>
> while installing<br>
><br>
> ./configure --disable-gui --with-server-home=/var/spool/PBS<br>
> --with-default-server=gcluster.grid --enable-unixsockets=no --enable-blcr<br>
> --disable-gcc-warnings<br>
><br>
> Also my mom_priv/config looks like<br>
><br>
> /var/spool/PBS/mom_priv/config<br>
> $checkpoint_script /var/spool/PBS/mom_priv/blcr_checkpoint_script<br>
> $restart_script /var/spool/PBS/mom_priv/blcr_restart_script<br>
> $checkpoint_run_exe /usr/local/bin/cr_run<br>
> $pbsserver gcluster.grid<br>
> $loglevel 7<br>
><br>
><br>
> I ve created blcr_checkpoint_script & blcr_restart_script scripts too<br>
><br>
> While job submission Im getting the following error .. Please help me to<br>
> solve this error.. Is there any thing else to be configured for this??<br>
><br>
> [guser02@gcluster ~]$ qsub -c enabled,periodic,shutdown,interval=1 test.sh<br>
> 1.gcluster.grid<br>
><br>
> [guser02@gcluster ~]$ qhold 1<br>
><br>
> [guser02@gcluster ~]$ qstat<br>
> Job id Name User Time Use S Queue<br>
> ------------------------- ---------------- --------------- -------- - -----<br>
> 1.gcluster test.sh guser02 0 R workq<br>
><br>
> [guser02@gcluster ~]$ qstat -f<br>
> Job Id: 1.gcluster.grid<br>
> Job_Name = test.sh<br>
> Job_Owner = guser02@gcluster.grid<br>
> job_state = R<br>
> queue = workq<br>
> server = gcluster.grid<br>
> Checkpoint = enabled,periodic,shutdown,interval=1<br>
> ctime = Fri Mar 26 17:20:03 2010<br>
> Error_Path = gcluster.grid:/home/guser02/test.sh.e1<br>
> exec_host = gcluster.grid/0<br>
> Hold_Types = n<br>
> Join_Path = n<br>
> Keep_Files = n<br>
> Mail_Points = a<br>
> mtime = Fri Mar 26 17:20:05 2010<br>
> Output_Path = gcluster.grid:/home/guser02/test.sh.o1<br>
> Priority = 0<br>
> qtime = Fri Mar 26 17:20:03 2010<br>
> Rerunable = True<br>
> Resource_List.nodect = 1<br>
> Resource_List.nodes = 1<br>
> session_id = 19993<br>
> Variable_List = PBS_O_HOME=/home/guser02,PBS_O_LOGNAME=guser02,<br>
><br>
> PBS_O_PATH=/usr/local/firefox/:/opt/mpich-1.2.6/bin:/usr/local/jdk1.5<br>
><br>
> .0_03/bin/:/usr/local/bin:/bin:/usr/bin:/usr/X11R6/bin:/bin:/usr/local<br>
><br>
> /tomcat-5.0.27/bin:/usr/local/ant-1.6.4/bin:/usr/local/globus-4.0.3/bi<br>
><br>
> n:/usr/local/globus-4.0.3/sbin:/bin:/usr/local/maui/bin:/usr/local/gw/<br>
><br>
> bin:/usr/local/rrdtool/bin:/opt/ganglia/bin:/usr/local/sbin:/usr/local<br>
> /bin:/usr/local/pdftk-1.41/pdftk:/home/guser02/bin,<br>
> PBS_O_MAIL=/var/spool/mail/guser02,PBS_O_SHELL=/bin/bash,<br>
> PBS_O_HOST=gcluster.grid,PBS_SERVER=gcluster.grid,<br>
> PBS_O_WORKDIR=/home/guser02,PBS_O_QUEUE=workq<br>
> comment = Scalar found where operator expected at<br>
> /var/spool/PBS/mom_priv/<br>
> blcr_checkpoint_script line 31,<br>
> near "$signalNum $depth"<br>
> (Missing operator before $depth?)<br>
> syntax e<br>
> rror at /var/spool/PBS/mom_priv/blcr_checkpoint_script line 31,<br>
> near "$signalNum $depth"<br>
> Global symbol "$depth" requires explicit pa<br>
> ckage name at /var/spool/PBS/mom_priv/blcr_checkpoint_script line<br>
> 31.<br>
><br>
> Execution of /var/spool/PBS/mom_priv/blcr_checkpoint_script aborted<br>
> du<br>
> e to compilation errors.<br>
><br>
> etime = Fri Mar 26 17:20:03 2010<br>
> submit_args = -c enabled,periodic,shutdown,interval=1 test.sh<br>
> start_time = Fri Mar 26 17:20:03 2010<br>
> start_count = 1<br>
> fault_tolerant = False<br>
><br>
><br>
> Regards<br>
> Rajiv R<br>
> Project Associate,<br>
> CARE,MIT,<br>
> Anna university ,Chennai<br>
><br>
</div></div>> _______________________________________________<br>
> torqueusers mailing list<br>
> <a href="mailto:torqueusers@supercluster.org">torqueusers@supercluster.org</a><br>
> <a href="http://www.supercluster.org/mailman/listinfo/torqueusers" target="_blank">http://www.supercluster.org/mailman/listinfo/torqueusers</a><br>
><br>
><br>
<br>
<br>
<br>
--<br>
<font color="#888888">Jazcek Braden<br>
</font></blockquote></div><br>