[padb] Réf. : Re: Réf. : Re: Réf. : Réf. : Réf. : Re: Réf. : Re: Réf. : Bullchanges ( with LSF -mpich2 wrapper and -openmpi_wrapper combined)

Ashley Pittman ashley at pittman.co.uk
Wed Feb 17 08:59:44 GMT 2010


On 16 Feb 2010, at 23:29, thipadin.seng-long at bull.net wrote:

> 
> On 16 Feb 2010, at 23:16 Ashley Pittman <ashley at pittman.co.uk> wrote: 
> 
> I was testing further and there's still another problem, i guess it came from the ps command you changed. 
> 
> [senglont at artemis1 lsf-ompi]$ ./padb -O rmgr=lsf -tx 1516 
> Use of uninitialized value in numeric eq (==) at ./padb line 2896. 
> Use of uninitialized value in numeric eq (==) at ./padb line 2896. 
> Use of uninitialized value in numeric eq (==) at ./padb line 2896. 
> Use of uninitialized value in numeric eq (==) at ./padb line 2896. 
> 
> Here's the result of the break point after the call to slurp_cmd: 

Can you try this patch, I'd forgotten how to format ps commands.  The following should work.

Index: padb
===================================================================
--- padb        (revision 391)
+++ padb        (working copy)
@@ -2937,7 +2937,7 @@
 
     #get ps from the leading host(the one that start mpirun.lsf)
     my @handle =
-      slurp_remote_cmd( $host, "ps -o pid=,ppid=,cmd= -u $target_user" );
+      slurp_remote_cmd( $host, "ps -o pid= -o ppid= -o cmd= -u $target_user" );
 
     $count_line = @handle;
     for ( my $i = 0 ; $i < $count_line ; $i++ ) {    # to avoid loop

-- 

Ashley Pittman, Bath, UK.

Padb - A parallel job inspection tool for cluster computing
http://padb.pittman.org.uk





More information about the padb-devel mailing list