[padb-users] padb stalls with no output.

Rahul Nabar rpnabar at gmail.com
Fri Aug 20 19:11:57 BST 2010


I'm stumped, since I had a perfectly running padb just a day ago and
now on the same system it won't produce any output at all. I'm sure
it's me doing something terribly silly, but can't figure out what!

------------------------------------------------------------------------------------------------------------
[rpnabar at eu001 test]$ whoami
rpnabar
[rpnabar at eu001 test]$ NP=64;mpirun  -np $NP --host
eu001,eu002,eu003,eu004,eu005,eu006,eu007,eu008   -mca btl
openib,sm,self    /opt/src/mpitests/imb/src/IMB-MPI1 -npmin $NP
gather

[wait for test to stall]
[then in another shell window onto the same node]

[rpnabar at eu001 ~]$ whoami
rpnabar

[rpnabar at eu001 ~]$ /opt/sbin/bin/padb --all --stack-trace --tree
--config-option rmgr=orte -v
Loading config from "/etc/padb.conf"
Loading config from "/home/rpnabar/.padbrc"
Loading config from environment
Loading config from command line
Setting 'rmgr' to 'orte'
Active jobs (0) are
No active jobs could be found for user 'rpnabar'

[gather test is still stalled in the other window]
------------------------------------------------------------------------------------------------------------


Any ideas what could be going on here? What's even more confusing is
that ompi-ps produces  no output either (see below)! Have I broken my
mpi install somehow? But that wouldn't make sense since the actual mpi
tests are running file. Again, the symptoms are so bizarre that I
suspect I am the one doing something stupid. But can't figure out what
it is!!

------------------------------------------------------------------------------------------------------------
[rpnabar at eu001 ~]$ /opt/sbin/bin/padb --show-jobs --config-option rmgr=orte
[No output]

[rpnabar at eu001 ~]$ ompi-ps
[No output]

[rpnabar at eu001 ~]$ ompi-ps -v
[eu001:11486] orte_ps: Acquiring list of HNPs and setting contact info
into RML...
[eu001:11486] orte_ps: Gathering Information for HNP: [[14224,0],0]:5891
[No output]
------------------------------------------------------------------------------------------------------------

-- 
Rahul




More information about the padb-users mailing list