Proactive fault tolerance using preemptive migration C Engelmann, GR Vallee, T Naughton, SL Scott 2009 17th Euromicro International Conference on Parallel, Distributed and …, 2009 | 153 | 2009 |
A survey of MPI usage in the US exascale computing project DE Bernholdt, S Boehm, G Bosilca, M Gorentla Venkata, RE Grant, ... Concurrency and Computation: Practice and Experience 32 (3), e4851, 2020 | 132 | 2020 |
A framework for proactive fault tolerance G Vallee, K Charoenpornwattana, C Engelmann, A Tikotekar, ... 2008 Third International Conference on Availability, Reliability and …, 2008 | 94 | 2008 |
Towards an efficient single system image cluster operating system C Morin, P Gallard, R Lottiaux, G Vallée Future Generation Computer Systems 20 (4), 505-521, 2004 | 88 | 2004 |
System-level virtualization for high performance computing G Vallee, T Naughton, C Engelmann, H Ong, SL Scott 16th Euromicro Conference on Parallel, Distributed and Network-Based …, 2008 | 87 | 2008 |
OpenMosix, OpenSSI and Kerrighed: a comparative study R Lottiaux, P Gallard, G Vallée, C Morin, B Boissinot CCGrid 2005. IEEE International Symposium on Cluster Computing and the Grid …, 2005 | 82 | 2005 |
Kerrighed: a single system image cluster operating system for high performance computing C Morin, R Lottiaux, G Vallée, P Gallard, G Utard, R Badrinath, L Rilling Euro-Par 2003 Parallel Processing: 9th International Euro-Par Conference …, 2003 | 78 | 2003 |
Checkpoint/restart of virtual machines based on Xen G Vallee, T Naughton, H Ong, SL Scott Proceedings of the High Availability and Performace Computing Workshop …, 2006 | 66 | 2006 |
Kerrighed and data parallelism: Cluster computing on single system image operating systems C Morin, R Lottiaux, G Vallée, P Gallard, D Margery, JY Berthou, ... 2004 IEEE International Conference on Cluster Computing (IEEE Cat. No …, 2004 | 66 | 2004 |
Fault injection framework for system resilience evaluation: fake faults for finding future failures T Naughton, W Bland, G Vallee, C Engelmann, SL Scott Proceedings of the 2009 workshop on Resiliency in high performance, 23-28, 2009 | 51 | 2009 |
System management software for virtual environments G Vallée, T Naughton, SL Scott Proceedings of the 4th international conference on Computing frontiers, 153-160, 2007 | 50 | 2007 |
{LADS}: Optimizing data transfers using {Layout-Aware} data scheduling Y Kim, S Atchley, GR Vallée, GM Shipman 13th USENIX Conference on File and Storage Technologies (FAST 15), 67-80, 2015 | 49 | 2015 |
A log-scaling fault tolerant agreement algorithm for a fault tolerant MPI J Hursey, T Naughton, G Vallee, RL Graham Recent Advances in the Message Passing Interface: 18th European MPI Users …, 2011 | 43 | 2011 |
A case for single system image cluster operating systems: the kerrighed approach G Vallée, R Lottiaux, L Rilling, JY Berthou, ID Malhen, C Morin Parallel Processing Letters 13 (02), 95-122, 2003 | 41 | 2003 |
Evaluation of fault-tolerant policies using simulation A Tikotekar, G Vallée, T Naughton, SL Scott, C Leangsuksun 2007 IEEE International Conference on Cluster Computing, 303-311, 2007 | 40 | 2007 |
Tagit: an integrated indexing and search service for file systems H Sim, Y Kim, SS Vazhkudai, GR Vallée, SH Lim, AR Butt Proceedings of the International Conference for High Performance Computing …, 2017 | 39 | 2017 |
An analysis of hpc benchmarks in virtual machine environments A Tikotekar, G Vallée, T Naughton, H Ong, C Engelmann, SL Scott Euro-Par 2008 Workshops-Parallel Processing: VHPC 2008, UNICORE 2008, HPPC …, 2009 | 35 | 2009 |
Effects of virtualization on a scientific application running a hyperspectral radiative transfer code on virtual machines A Tikotekar, G Vallée, T Naughton, H Ong, C Engelmann, SL Scott, ... Proceedings of the 2nd workshop on System-level virtualization for high …, 2008 | 35 | 2008 |
Process migration based on gobelins distributed shared memory G Vallee, C Morin, R Lottiaux, J Berthou, ID Malen 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid …, 2002 | 34 | 2002 |
A new approach to configurable dynamic scheduling in clusters based on single system image technologies G Vallée, C Morin, JY Berthou, L Rilling Proceedings International Parallel and Distributed Processing Symposium, 8 pp., 2003 | 31 | 2003 |