Name: mpistat
Owner: Wellcome Trust Sanger Institute - Human Genetics Informatics
Description: Efficiently lstat every file in a filesystem tree in parallel across multiple nodes using MPI.
Created: 2014-11-07 13:24:02.0
Updated: 2016-01-18 11:18:56.0
Pushed: 2016-09-19 15:05:17.0
Size: 70
Language: Python
GitHub Committers
User | Most Recent Commit | # Commits |
---|
Other Committers
User | Most Recent Commit | # Commits |
---|
Efficiently walk a tree in parallel across many nodes using MPI. Based on http://conferences.computer.org/sc/2012/papers/1000a015.pdf. Also see http://jlafon.io/parallel-file-treewalk.html.
There is a C version using libcircle (https://github.com/hpc/libcircle) which is the C implementation used by the authors of the original article.
The python version uses Guy Coates's interpretation (https://github.com/wtsi-ssg/pwalk) of the algorithm discussed in the paper.
There is no significant difference in the time taken by the C and python versions due to the time being dominated by the lstat syscall.