[padb] [padb-devel] Simple Makefile patch

Ethan Mallove ethan.mallove at sun.com
Thu Nov 5 21:36:45 GMT 2009


On Thu, Nov/05/2009 09:28:07PM, Ashley Pittman wrote:
> On Thu, 2009-11-05 at 13:23 -0500, Ethan Mallove wrote:
> > On Thu, Nov/05/2009 04:53:49PM, Ashley Pittman wrote: 
> > > I had a thought about this last night, it could be that the scanf
> > > qualifier %p isn't working, googling around for some docs suggest that %
> > > p won't work, can you try the attached patch to see if that fixes the
> > > issue?  Failing that we might have to wait until I can next get my hands
> > > on a sun machine.
> 
> > gcc minfo.c -o minfo.x -ldl -Wall -g
> > minfo.c: In function 'find_sym':
> > minfo.c:146: warning: format '%lx' expects type 'long unsigned int *', but argument 3 has type 'void **'
> > minfo.c:148: warning: format '%lx' expects type 'long unsigned int', but argument 3 has type 'void *'
> > minfo.c: In function '_find_data':
> > minfo.c:306: warning: format '%lx' expects type 'long unsigned int', but argument 3 has type 'void *'
> > minfo.c: In function 'fetch_string':
> > minfo.c:362: warning: format '%lx' expects type 'long unsigned int', but argument 4 has type 'void *'
> > minfo.c: In function 'show_op':
> > minfo.c:468: warning: format '%lx' expects type 'long unsigned int', but argument 3 has type 'void *'
> 
> Those errors are OK, I get them on Linux as well.  If it works I'll add
> casts to quieten them down.
> 
> I've booted a Solaris machine here and it seems I was right about this,
> using %p in sscanf doesn't work on solaris so the patch should work or
> at least be a step in the right direction.  Oddly enough sscanf returns
> 1 in this case which would seem to indicate it had matched, that code
> already has a test for the return code of sscanf here but it's not
> triggering :(
> 
> Could you run with the patch applied and send me output and the  log
> file generated please.  You can just run "padb -aqr0" to show the
> message queues of rank zero rather than --full-command which will
> attempt to show you everything about the job.


$ padb -aqr0 --config-option rmgr=mpirun --full-report=15961
padb version 3.n (Revision 312)
full job report for job 15961

Warning: errors reported by some ranks
========
[0]: Error message from /home/em162155/software/SunOS/sparc/padb/bin/minfo.x: Could not load symbols from dll
========
Warning: errors reported by some ranks
========
[0]: Error message from /home/em162155/software/SunOS/sparc/padb/bin/minfo.x: Could not load symbols from dll
========
Total: 0 communicators, no communication data recorded.
-----------------
[0] (1 processes)
-----------------
main() at hello_c.c:18
      params
        int     argc = '1' [0]
        char ** argv = '0xffbfe304' [0]
      locals
        int rank = '0' [0]
        int size = '-4201424' [0]
  -----------------
  [0] (1 processes)
  -----------------
  sleep() at ?:?
    ___nanosleep() at ?:?


$ cat /tmp/padb-minfo-debug-log-0-oFPPDR
req: sym MPIR_dll_name
ok 0xff1f9824
req: string 1024 0xff1f9824
ok /ws/hpc-ct-1/hpc-ct-8.2.1/pkgs/09d/SunOS-10/sparc/built-with-sun/installs/r0Y7/install/lib/openmpi/libompi_dbg_msgq.so
zzz: str:31 dmsg
Could not load symbols from dll
zzz: str:3 exit
die

$ cat /tmp/padb-minfo-debug-log-0-ZA3VBw
req: sym MPIR_dll_name
ok 0xff1f9824
req: string 1024 0xff1f9824
ok /ws/hpc-ct-1/hpc-ct-8.2.1/pkgs/09d/SunOS-10/sparc/built-with-sun/installs/r0Y7/install/lib/openmpi/libompi_dbg_msgq.so
zzz: str:31 dmsg
Could not load symbols from dll
zzz: str:3 exit
die

-Ethan

> 
> Ashley.
> 
> 
> -- 
> 
> Ashley Pittman, Bath, UK.
> 
> Padb - A parallel job inspection tool for cluster computing
> http://padb.pittman.org.uk
> 




More information about the padb-devel mailing list