new tool in PandaRoot: mpiTools [message #6142] |
Sat, 22 March 2008 13:52 |
Johan Messchendorp
Messages: 693 Registered: April 2007 Location: University of Groningen
|
first-grade participant |
From: *xs4all.nl
|
|
Dear all,
Some time ago, I added a new package in the toolbox of PandaRoot (/PndTools) called "mpiTools". This is a small program exploiting MPI (message passing interface) which allows an "easy" and a parallel distribution of jobs on computing clusters such as PBS/LSF or any set of computers which can communicate via ssh or rsh. It can, furthermore, run on shared-memory systems, such as multi-core machines. This code is based on a boss-workers model. More detailed information can be found in a README file in the corresponding directory. Here you will also find example scripts and macros.
Kind wishes and happy Easter,
Johan
Johan Messchendorp
University of Groningen/KVI
Zernikelaan 25
NL-9747 AA Groningen
The Netherlands
tel. +31-503633558
fax +31-503634003
[Updated on: Sun, 21 December 2008 12:31] Report message to a moderator
|
|
|
|
|
Re: new tool in PandaRoot: mpiTools [message #7654 is a reply to message #6142] |
Sun, 21 December 2008 12:28 |
Johan Messchendorp
Messages: 693 Registered: April 2007 Location: University of Groningen
|
first-grade participant |
From: *xs4all.nl
|
|
Hi,
I added a few new features (downwards compatible, though) in mpiTools, namely
* more options can be set "on-the-fly" via the job description file, such as timeouts, scratch path, minimum required disk space, maximum allowed threads per worker, etc. With this, one does not need to specify these options on the start-up of the code itself.
* it is possible to specify different TIMEOUTS for the time period a job is being killed and the sleep period in case a worker suffers from insufficient disk space or too many thread already running. Before, only one TIMEOUT could be specified.
* An option "NICE <lvl>" can be specified in the job description file, which sets the priority level of the following JOBS, e.g. the following jobs will run with a nice level of <lvl> on the worker nodes. Note that -20 is the highest priority and +20, the lowest. Negative nice levels can only be set by the root user.
* An option "MAXLOAD <lvl> can be specified in the job description file, which defines the maximum load of a worker node. Exceeding this number will put the worker to sleep specified in TIMEOUTS.
* Added a wrapper script "runmpi" for an MPI session, which is in particularly interesting for heterogeneous systems
* Several optimizations, better error and log handling
* Inclusion of CPU time in log output
* PBS script modified to support MPI2
The wiki site http://panda-wiki.gsi.de/cgi-bin/view/Computing/PandaRootTools and the README file have been updated according to the changes. The updates have been carried out in the "trunk" branch of PandaRoot (rev4257 or higher).
Johan.
[Updated on: Wed, 31 December 2008 01:40] Report message to a moderator
|
|
|