<br><h3 class="gD" style="color: rgb(121, 6, 25);"><span>Hi Jazcek Braden</span></h3><br>Hi  Ive modified that <b> blcr_checkpoint_script  </b>and removed that &quot;depth&quot; variable and again submitted that test job and I cant hold that jobs ..Job still remains in running state .. The steps ive done are as follows .. Pls help me to solve this issue<br>
<br>[guser02@gcluster ~]$ qsub -c enabled,periodic,shutdown,interval=1 test.sh<br><br>guser02@gcluster ~]$ qhold 8<br><br>[guser02@gcluster ~]$ qstat<br>Job id                    Name             User            Time Use S Queue<br>
------------------------- ---------------- --------------- -------- - -----<br>8.gcluster                test.sh          guser02                0 R workq<br><br><br>[guser02@gcluster ~]$ qstat -f<br>Job Id: 8.gcluster.grid<br>
    Job_Name = test.sh<br>    Job_Owner = guser02@gcluster.grid<br>    job_state = R<br>    queue = workq<br>    server = gcluster.grid<br>    Checkpoint = enabled,periodic,shutdown,interval=1<br>    ctime = Fri Mar 26 19:07:07 2010<br>
    Error_Path = gcluster.grid:/home/guser02/test.sh.e8<br>    exec_host = gcluster.grid/0<br>    Hold_Types = n<br>    Join_Path = n<br>    Keep_Files = n<br>    Mail_Points = a<br>    mtime = Fri Mar 26 19:07:12 2010<br>
    Output_Path = gcluster.grid:/home/guser02/test.sh.o8<br>    Priority = 0<br>    qtime = Fri Mar 26 19:07:07 2010<br>    Rerunable = True<br>    Resource_List.nodect = 1<br>    Resource_List.nodes = 1<br>    session_id = 20882<br>
    Variable_List = PBS_O_HOME=/home/guser02,PBS_O_LOGNAME=guser02,<br>        PBS_O_PATH=/usr/local/firefox/:/opt/mpich-1.2.6/bin:/usr/local/jdk1.5<br>        .0_03/bin/:/usr/local/bin:/bin:/usr/bin:/usr/X11R6/bin:/bin:/usr/local<br>
        /tomcat-5.0.27/bin:/usr/local/ant-1.6.4/bin:/usr/local/globus-4.0.3/bi<br>        n:/usr/local/globus-4.0.3/sbin:/bin:/usr/local/maui/bin:/usr/local/gw/<br>        bin:/usr/local/rrdtool/bin:/opt/ganglia/bin:/usr/local/sbin:/usr/local<br>
        /bin:/usr/local/pdftk-1.41/pdftk:/home/guser02/bin,<br>        PBS_O_MAIL=/var/spool/mail/guser02,PBS_O_SHELL=/bin/bash,<br>        PBS_O_HOST=gcluster.grid,PBS_SERVER=gcluster.grid,<br>        PBS_O_WORKDIR=/home/guser02,PBS_O_QUEUE=workq<br>
    comment = Usage: /var/spool/PBS/mom_priv/blcr_checkpoint_script<br><br>    etime = Fri Mar 26 19:07:07 2010<br>    submit_args = -c enabled,periodic,shutdown,interval=1 test.sh<br>    start_time = Fri Mar 26 19:07:07 2010<br>
    start_count = 1<br>    fault_tolerant = False<br><br><br><div class="gmail_quote">On Fri, Mar 26, 2010 at 5:15 PM, Jazcek Braden <span dir="ltr">&lt;<a href="mailto:jazcek@gmail.com">jazcek@gmail.com</a>&gt;</span> wrote:<br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">There is a typo in the script in the documentation, they try to use a<br>
variable called depth without defining it in the my statement a few<br>
lines up<br>
<br>
-- Jazcek<br>
<div><div></div><div class="h5"><br>
On Fri, Mar 26, 2010 at 6:59 AM, Rajiv Rajaian &lt;<a href="mailto:rajiv.care@gmail.com">rajiv.care@gmail.com</a>&gt; wrote:<br>
&gt; Hi all<br>
&gt; I ve installed torque 2.4.6 and enabled the blcr with the following option<br>
&gt; while installing<br>
&gt;<br>
&gt; ./configure --disable-gui --with-server-home=/var/spool/PBS<br>
&gt; --with-default-server=gcluster.grid --enable-unixsockets=no --enable-blcr<br>
&gt; --disable-gcc-warnings<br>
&gt;<br>
&gt; Also my mom_priv/config looks like<br>
&gt;<br>
&gt; /var/spool/PBS/mom_priv/config<br>
&gt; $checkpoint_script  /var/spool/PBS/mom_priv/blcr_checkpoint_script<br>
&gt; $restart_script  /var/spool/PBS/mom_priv/blcr_restart_script<br>
&gt; $checkpoint_run_exe /usr/local/bin/cr_run<br>
&gt; $pbsserver gcluster.grid<br>
&gt; $loglevel 7<br>
&gt;<br>
&gt;<br>
&gt; I ve created blcr_checkpoint_script &amp; blcr_restart_script scripts too<br>
&gt;<br>
&gt; While job submission Im getting the following error .. Please help me to<br>
&gt; solve this error.. Is there any thing else to be configured for this??<br>
&gt;<br>
&gt; [guser02@gcluster ~]$ qsub -c enabled,periodic,shutdown,interval=1 test.sh<br>
&gt; 1.gcluster.grid<br>
&gt;<br>
&gt; [guser02@gcluster ~]$ qhold 1<br>
&gt;<br>
&gt; [guser02@gcluster ~]$ qstat<br>
&gt; Job id                    Name             User            Time Use S Queue<br>
&gt; ------------------------- ---------------- --------------- -------- - -----<br>
&gt; 1.gcluster                test.sh          guser02                0 R workq<br>
&gt;<br>
&gt; [guser02@gcluster ~]$ qstat -f<br>
&gt; Job Id: 1.gcluster.grid<br>
&gt;     Job_Name = test.sh<br>
&gt;     Job_Owner = guser02@gcluster.grid<br>
&gt;     job_state = R<br>
&gt;     queue = workq<br>
&gt;     server = gcluster.grid<br>
&gt;     Checkpoint = enabled,periodic,shutdown,interval=1<br>
&gt;     ctime = Fri Mar 26 17:20:03 2010<br>
&gt;     Error_Path = gcluster.grid:/home/guser02/test.sh.e1<br>
&gt;     exec_host = gcluster.grid/0<br>
&gt;     Hold_Types = n<br>
&gt;     Join_Path = n<br>
&gt;     Keep_Files = n<br>
&gt;     Mail_Points = a<br>
&gt;     mtime = Fri Mar 26 17:20:05 2010<br>
&gt;     Output_Path = gcluster.grid:/home/guser02/test.sh.o1<br>
&gt;     Priority = 0<br>
&gt;     qtime = Fri Mar 26 17:20:03 2010<br>
&gt;     Rerunable = True<br>
&gt;     Resource_List.nodect = 1<br>
&gt;     Resource_List.nodes = 1<br>
&gt;     session_id = 19993<br>
&gt;     Variable_List = PBS_O_HOME=/home/guser02,PBS_O_LOGNAME=guser02,<br>
&gt;<br>
&gt; PBS_O_PATH=/usr/local/firefox/:/opt/mpich-1.2.6/bin:/usr/local/jdk1.5<br>
&gt;<br>
&gt; .0_03/bin/:/usr/local/bin:/bin:/usr/bin:/usr/X11R6/bin:/bin:/usr/local<br>
&gt;<br>
&gt; /tomcat-5.0.27/bin:/usr/local/ant-1.6.4/bin:/usr/local/globus-4.0.3/bi<br>
&gt;<br>
&gt; n:/usr/local/globus-4.0.3/sbin:/bin:/usr/local/maui/bin:/usr/local/gw/<br>
&gt;<br>
&gt; bin:/usr/local/rrdtool/bin:/opt/ganglia/bin:/usr/local/sbin:/usr/local<br>
&gt;         /bin:/usr/local/pdftk-1.41/pdftk:/home/guser02/bin,<br>
&gt;         PBS_O_MAIL=/var/spool/mail/guser02,PBS_O_SHELL=/bin/bash,<br>
&gt;         PBS_O_HOST=gcluster.grid,PBS_SERVER=gcluster.grid,<br>
&gt;         PBS_O_WORKDIR=/home/guser02,PBS_O_QUEUE=workq<br>
&gt;     comment = Scalar found where operator expected at<br>
&gt; /var/spool/PBS/mom_priv/<br>
&gt;         blcr_checkpoint_script line 31,<br>
&gt;          near &quot;$signalNum $depth&quot;<br>
&gt;         (Missing operator before $depth?)<br>
&gt; syntax e<br>
&gt;         rror at /var/spool/PBS/mom_priv/blcr_checkpoint_script line 31,<br>
&gt;          near &quot;$signalNum $depth&quot;<br>
&gt; Global symbol &quot;$depth&quot; requires explicit pa<br>
&gt;         ckage name at /var/spool/PBS/mom_priv/blcr_checkpoint_script line<br>
&gt; 31.<br>
&gt;<br>
&gt;         Execution of /var/spool/PBS/mom_priv/blcr_checkpoint_script aborted<br>
&gt; du<br>
&gt;         e to compilation errors.<br>
&gt;<br>
&gt;     etime = Fri Mar 26 17:20:03 2010<br>
&gt;     submit_args = -c enabled,periodic,shutdown,interval=1 test.sh<br>
&gt;     start_time = Fri Mar 26 17:20:03 2010<br>
&gt;     start_count = 1<br>
&gt;     fault_tolerant = False<br>
&gt;<br>
&gt;<br>
&gt; Regards<br>
&gt; Rajiv R<br>
&gt; Project Associate,<br>
&gt; CARE,MIT,<br>
&gt; Anna university ,Chennai<br>
&gt;<br>
</div></div>&gt; _______________________________________________<br>
&gt; torqueusers mailing list<br>
&gt; <a href="mailto:torqueusers@supercluster.org">torqueusers@supercluster.org</a><br>
&gt; <a href="http://www.supercluster.org/mailman/listinfo/torqueusers" target="_blank">http://www.supercluster.org/mailman/listinfo/torqueusers</a><br>
&gt;<br>
&gt;<br>
<br>
<br>
<br>
--<br>
<font color="#888888">Jazcek Braden<br>
</font></blockquote></div><br>