[Beowulf] Exascale by the end of the year?

Christopher Samuel samuel at unimelb.edu.au
Wed Mar 5 18:49:39 PST 2014


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

On 06/03/14 03:07, Joe Landman wrote:

> I've not done much with MPI in a few years, have they extended it
> beyond MPI_Init yet?  Can MPI procs just join a "borgified"
> collective, preserve state so restarts/moves/reschedules of ranks
> are cheap?  If not, what is the replacement for MPI that will do
> this?

Oops, forgot this in my previous email - I stumbled across the Uni of
Tenessee's ULFM (User Level Failure Mitigation) project which has a
Wordpress blog here:

http://fault-tolerance.org/

There is the PDF for a two page flyer from SC13 on the site which
gives an overview and describes it thus:

http://fault-tolerance.org/wp-content/uploads/2013/12/SC13-ULFM.pdf

# User Level Failure Mitigation is a set of MPI interface extensions
# enabling Message Passing programs to restore MPI communication
# capabilities affected by process failures. It supports rebuilding
# communicators, RMA windows and I/O Files

All the best,
Chris
- -- 
 Christopher Samuel        Senior Systems Administrator
 VLSCI - Victorian Life Sciences Computation Initiative
 Email: samuel at unimelb.edu.au Phone: +61 (0)3 903 55545
 http://www.vlsci.org.au/      http://twitter.com/vlsci

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.14 (GNU/Linux)
Comment: Using GnuPG with Thunderbird - http://www.enigmail.net/

iEYEARECAAYFAlMX4kMACgkQO2KABBYQAh9iXgCffxwP07z91by2FCHxVRwtTl4Q
yTUAni3Xn0C+Nla0rS4HwW2dfF4Czb0Q
=yWTJ
-----END PGP SIGNATURE-----



More information about the Beowulf mailing list