<br><font size=2 face="Courier New">On 2009-11-18 at 06:10 PM , ashley@pittman.co.uk wrot</font><font size=2 face="sans-serif">e:</font>
<br>
<br><font size=2 face="Courier New">> How does this work if you do say ./src/padb -axt? If it works in that<br>
> case then I'm happy with the code and I'll commit it, I've not added<br>
> anything before as I couldn't think of a generalised solution.</font>
<br>
<br><font size=2 face="Courier New">I have a generalised solution now (in pbs_setup_job):</font>
<br><font size=2 face="Courier New">I have tested with: ../padb</font>
<br><font size=2 face="Courier New"> DirTest/padb</font>
<br><font size=2 face="Courier New"> ./DirTest/padb</font>
<br><font size=2 face="Courier New"> ./padb</font>
<br><font size=2 face="Courier New"> /full_path/padb</font>
<br><font size=2 face="Courier New"> padb (/usr/bin/padb)</font>
<br>
<br><font size=2 face="Courier New">>> 4077 foreach my $proc ( 0 .. $comm_data->{nprocesses} - 1 ) { <br>
<br>
> Are you able to extract the process count from the job id and return it<br>
> as "nprocesses" in the hash returned by pbs_setup_job()? I'm not<br>
> familiar with qstat so I don't know how to find this information.</font>
<br>
<br><font size=2 face="Courier New">Ok it is done in pbs_get_lqsub when it treats "qstat -w -n -u $user \@$server" data</font>
<br><font size=2 face="Courier New">The $nprocess is stored in $pbs_tabjobs{$job}{nproc}, and will used in pbs_setup_job.</font>
<br>
<br><font size=2 face="Courier New">The Output test look like:</font>
<br>
<br><font size=2 face="Courier New">[thipa@xn5 padb_open]$ DirTest/padb -O rmgr=pbs -O stack-shows-locals=no -O stack-shows-params=no -tx 27616</font>
<br><font size=2 face="Courier New">Warning, failed to locate ranks [5,7]</font>
<br><font size=2 face="Courier New">-----------------</font>
<br><font size=2 face="Courier New">[0-1,4,6] (4 processes)</font>
<br><font size=2 face="Courier New">-----------------</font>
<br><font size=2 face="Courier New">main() at pp_sndrcv_spbl.c:50</font>
<br><font size=2 face="Courier New"> PMPI_Finalize() at ?:?</font>
<br><font size=2 face="Courier New"> MPID_Finalize() at ?:?</font>
<br><font size=2 face="Courier New"> MPIDI_CH3_Progress_wait() at ?:?</font>
<br><font size=2 face="Courier New"> MPIDU_Sock_wait() at ?:?</font>
<br><font size=2 face="Courier New"> poll() at ?:?</font>
<br><font size=2 face="Courier New">-----------------</font>
<br><font size=2 face="Courier New">[2-3] (2 processes)</font>
<br><font size=2 face="Courier New">-----------------</font>
<br><font size=2 face="Courier New">ThreadId: 1</font>
<br><font size=2 face="Courier New"> -----------------</font>
<br><font size=2 face="Courier New"> [2] (1 processes)</font>
<br><font size=2 face="Courier New"> -----------------</font>
<br><font size=2 face="Courier New"> main() at pp_sndrcv_spbl.c:46</font>
<br><font size=2 face="Courier New"> PMPI_Recv() at ?:?</font>
<br><font size=2 face="Courier New"> MPID_Progress_wait() at ?:?</font>
<br><font size=2 face="Courier New"> MPIDI_CH3_Progress_wait() at ?:?</font>
<br><font size=2 face="Courier New"> MPIDU_Sock_wait() at ?:?</font>
<br><font size=2 face="Courier New"> </font>
<br><font size=2 face="Courier New"> ThreadId: 2</font>
<br><font size=2 face="Courier New"> start_thread() at ?:?</font>
<br><font size=2 face="Courier New"> fd_server() at server.c:354</font>
<br><font size=2 face="Courier New"> select() at ?:?</font>
<br><font size=2 face="Courier New"> -----------------</font>
<br><font size=2 face="Courier New"> [3] (1 processes)</font>
<br><font size=2 face="Courier New"> -----------------</font>
<br><font size=2 face="Courier New"> main() at pp_sndrcv_spbl.c:50</font>
<br><font size=2 face="Courier New"> PMPI_Finalize() at ?:?</font>
<br><font size=2 face="Courier New"> MPID_Finalize() at ?:?</font>
<br><font size=2 face="Courier New"> MPIDI_CH3_Progress_wait() at ?:?</font>
<br><font size=2 face="Courier New"> MPIDU_Sock_wait() at ?:?</font>
<br><font size=2 face="Courier New"> poll() at ?:?</font>
<br><font size=2 face="Courier New"> ThreadId: 2</font>
<br><font size=2 face="Courier New"> start_thread() at ?:?</font>
<br><font size=2 face="Courier New"> fd_server() at server.c:354</font>
<br><font size=2 face="Courier New"> select() at ?:?</font>
<br><font size=2 face="Courier New">[thipa@xn5 padb_open]$ </font>
<br>
<br><font size=2 face="Courier New">Remark: Warning, failed to locate ranks [5,7]==> it is normal (processes terminated)</font>
<br>
<br><font size=2 face="Courier New">> I'll be able to take a closer look when I'm back from SC, I only have my<br>
> netbook with me and aren't able to test anything from here, the patch<br>
> looks good so far however.</font>
<br>
<br><font size=2 face="Courier New">I am giving you the new patch with diff -u here.</font>
<br>
<br>
<br>
<br><font size=2 face="Courier New">Hope you will look at it soon.</font>
<br><font size=2 face="Courier New">I think there is no more problem now and say it works.</font>
<br>
<br>
<br><font size=2 face="Courier New">Regards.</font>
<br><font size=2 face="Courier New">Thipadin.</font>
<br>