[padb] Patch of support of Slurm + Openmpi Orte manager

Ashley Pittman ashley at pittman.co.uk
Wed Dec 2 17:48:56 GMT 2009


On Wed, 2009-12-02 at 15:51 +0000, Ashley Pittman wrote:
> 
> I'm wondering if it might be better to simply walk all processes in a
> very similar way to pbs_find_pids and check for OMPI_COMM_WORLD_RANK
> OMPI_COMM_WORLD_SIZE, SLURM_JOB_ID and SLURM_STEP_ID.  This code could
> then be used as a fallback in case scontol listpids failed to return
> any
> pids and hence wouldn't need any options twiddled to enable it.
> 
> Combined with some more intelligent setting of default values for
> slurm_job_step and that could make this case full automatic with the
> user just specifying the jobid and nothing else. 

The attached patch implements just that, "padb -a --proc-summary
-Ormgr=slurm" works for me correctly in all cases I've tested.

Let me know if this works for you and if you're happy with this
approach.

Ashley,

-- 

Ashley Pittman, Bath, UK.

Padb - A parallel job inspection tool for cluster computing
http://padb.pittman.org.uk
-------------- next part --------------
A non-text attachment was scrubbed...
Name: padb-slurm-open-3.patch
Type: text/x-patch
Size: 2716 bytes
Desc: not available
URL: <http://pittman.org.uk/pipermail/padb-devel_pittman.org.uk/attachments/20091202/5e2fdf17/attachment.bin>


More information about the padb-devel mailing list