|
|
|
|
Venues (Conferences, Journals, ...)
|
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 108 occurrences of 76 keywords
|
|
|
|
|
Results
Found 80 publication records. Showing 80 according to the selection in the facets
| Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
| 3 | Joshua Hursey, Timothy Mattox, Andrew Lumsdaine |
Interconnect agnostic checkpoint/restart in open MPI.  |
HPDC  |
2009 |
DBLP DOI BibTeX RDF |
checkpoint coordination protocol, fault tolerance, MPI, shared memory, rollback-recovery, infiniband, myrinet, high speed interconnect, checkpoint/restart |
| 3 | Oreste Villa, Sriram Krishnamoorthy, Jarek Nieplocha, David M. Brown Jr. |
Scalable transparent checkpoint-restart of global address space applications on virtual machines over infiniband.  |
Conf. Computing Frontiers  |
2009 |
DBLP DOI BibTeX RDF |
checkpoint-restart, virtual machines, infiniband, global address space |
| 2 | Xiangyong Ouyang, Raghunath Rajachandrasekar, Xavier Besseron, Hao Wang, Jian Huang, Dhabaleswar K. Panda |
CRFS: A Lightweight User-Level Filesystem for Generic Checkpoint/Restart.  |
ICPP  |
2011 |
DBLP DOI BibTeX RDF |
checkpoint-restart, userspace filesystem, write aggregation |
| 2 | Stephen L. Scott, Christian Engelmann, Geoffroy Vallée, Thomas Naughton, Anand Tikotekar, George Ostrouchov, Chokchai Leangsuksun, Nichamon Naksinehaboon, Raja Nassar, Mihaela Paun, Frank Mueller, Chao Wang, Arun Babu Nagarajan, Jyothish Varma |
A tunable holistic resiliency approach for high-performance computing systems.  |
PPOPP  |
2009 |
DBLP DOI BibTeX RDF |
preemptive migration, fault tolerance, high-performance computing, resilience, checkpoint/restart |
| 2 | Yudan Liu, Raja Nassar, Chokchai Leangsuksun, Nichamon Naksinehaboon, Mihaela Paun, Stephen L. Scott |
An optimal checkpoint/restart model for a large scale high performance computing system.  |
IPDPS  |
2008 |
DBLP DOI BibTeX RDF |
|
| 2 | Joshua Hursey, Jeffrey M. Squyres, Timothy Mattox, Andrew Lumsdaine |
The Design and Implementation of Checkpoint/Restart Process Fault Tolerance for Open MPI.  |
IPDPS  |
2007 |
DBLP DOI BibTeX RDF |
|
| 2 | R. Badrinath, R. Krishnakumar, R. K. Palanivel Rajan |
Virtualization aware job schedulers for checkpoint-restart.  |
ICPADS  |
2007 |
DBLP DOI BibTeX RDF |
|
| 2 | Yudan Liu, Raja Nassar, Chokchai Leangsuksun, Nichamon Naksinehaboon, Mihaela Paun, Stephen L. Scott |
A reliability-aware approach for an optimal checkpoint/restart model in HPC environments.  |
CLUSTER  |
2007 |
DBLP DOI BibTeX RDF |
|
| 2 | Qi Gao, Weikuan Yu, Wei Huang, Dhabaleswar K. Panda |
Application-Transparent Checkpoint/Restart for MPI Programs over InfiniBand.  |
ICPP  |
2006 |
DBLP DOI BibTeX RDF |
|
| 2 | José Carlos Sancho, Fabrizio Petrini, Kei Davis, Roberto Gioiosa, Song Jiang |
Current Practice and a Direction Forward in Checkpoint/Restart Implementations for Fault Tolerance.  |
IPDPS  |
2005 |
DBLP DOI BibTeX RDF |
|
| 2 | G. John Janakiraman, Jose Renato Santos, Dinesh Subhraveti, Yoshio Turner |
Cruz: Application-Transparent Distributed Checkpoint-Restart on Standard Operating Systems.  |
DSN  |
2005 |
DBLP DOI BibTeX RDF |
|
| 2 | Oren Laadan, Dan B. Phung, Jason Nieh |
Transparent Checkpoint-Restart of Distributed Applications on Commodity Clusters.  |
CLUSTER  |
2005 |
DBLP DOI BibTeX RDF |
|
| 2 | Yudan Liu, Chokchai Leangsuksun, Hertong Song, Stephen L. Scott |
Reliability-aware Checkpoint/Restart Scheme: A Performability Trade-off.  |
CLUSTER  |
2005 |
DBLP DOI BibTeX RDF |
|
| 2 | Geoffroy Vallée, Renaud Lottiaux, David Margery, Christine Morin |
Ghost Process: a Sound Basis to Implement Process Duplication, Migration and Checkpoint/Restart in Linux Clusters.  |
ISPDC  |
2005 |
DBLP DOI BibTeX RDF |
process virtualization, distributed system, operating system, Linux cluster, single system image |
| 2 | Shaya Potter, Jason Nieh |
WebPod: persistent Web browsing sessions with pocketable storage devices.  |
WWW  |
2005 |
DBLP DOI BibTeX RDF |
portable storage, virtualization, web browsing, process migration, checkpoint/restart |
| 2 | Jiannong Cao, Yinghao Li, Minyi Guo |
Process Migration for MPI Applications based on Coordinated Checkpoint.  |
ICPADS  |
2005 |
DBLP DOI BibTeX RDF |
MPI, process migration, checkpoint/restart, coordinated checkpoint |
| 2 | Adnan Agbaria, Roy Friedman |
Starfish: Fault-Tolerant Dynamic MPI Programs on Clusters of Workstations.  |
Cluster Computing  |
2003 |
DBLP DOI BibTeX RDF |
fault-tolerance, distributed system, MPI, high performance, checkpoint/restart |
| 2 | Kalman Z. Meth, William G. Tuel Jr. |
Parallel Checkpoint/Restart without Message Logging. (PDF / PS)  |
ICPP Workshops  |
2000 |
DBLP DOI BibTeX RDF |
|
| 2 | Andrea Clematis, Vittoria Gianuzzi |
CPVM - Extending PVM for Consistent Checkpointing.  |
PDP  |
1996 |
DBLP DOI BibTeX RDF |
CPVM, consistent checkpointing, global checkpoint-restart algorithms, job-swapping, parallel programming, software tools, concurrency control, migration, deadlocks, termination, software fault tolerance, software fault-tolerance, software libraries, software library, PVM, Parallel Virtual Machine, software portability, nonblocking |
| 2 | Vittoria Gianuzzi, F. Merani |
Using PVM to implement a distributed dependable simulation system.  |
PDP  |
1995 |
DBLP DOI BibTeX RDF |
distributed dependable simulation system, PVM routines, fault tolerant mechanisms, checkpoint-restart mechanism, distributed algorithms, distributed algorithms, fault tolerant computing, message passing, synchronisation, simulations modelling, Virtual Time, high speed interconnection |
| 1 | Devarshi Ghoshal, Sreesudhan R. Ramkumar, Arun Chauhan |
Distributed Speculative Parallelization using Checkpoint Restart.  |
Procedia CS  |
2011 |
DBLP DOI BibTeX RDF |
|
| 1 | Yawei Li, Zhiling Lan |
FREM: A Fast Restart Mechanism for General Checkpoint/Restart.  |
IEEE Trans. Computers  |
2011 |
DBLP DOI BibTeX RDF |
|
| 1 | |
Checkpoint/Restart.  |
Encyclopedia of Parallel Computing  |
2011 |
DBLP DOI BibTeX RDF |
|
| 1 | Raghunath Rajachandrasekar, Xiangyong Ouyang, Xavier Besseron, Vilobh Meshram, Dhabaleswar K. Panda |
Can Checkpoint/Restart Mechanisms Benefit from Hierarchical Data Staging?  |
Euro-Par Workshops  |
2011 |
DBLP DOI BibTeX RDF |
|
| 1 | Bogdan Nicolae, Franck Cappello |
BlobCR: efficient checkpoint-restart for HPC applications on IaaS clouds using virtual disk image snapshots.  |
SC  |
2011 |
DBLP DOI BibTeX RDF |
|
| 1 | Akira Nukada, Hiroyuki Takizawa, Satoshi Matsuoka |
NVCR: A Transparent Checkpoint-Restart Library for NVIDIA CUDA.  |
IPDPS Workshops  |
2011 |
DBLP DOI BibTeX RDF |
|
| 1 | Andrew G. Schmidt, Bin Huang, Ron Sass, Matthew French |
Checkpoint/Restart and Beyond: Resilient High Performance Computing with FPGAs.  |
FCCM  |
2011 |
DBLP DOI BibTeX RDF |
|
| 1 | Supada Laosooksathit, Nichamon Naksinehaboon, Chokchai Leangsuksun |
Two-level checkpoint/restart modeling for GPGPU.  |
AICCSA  |
2011 |
DBLP DOI BibTeX RDF |
|
| 1 | Joshua Hursey, Chris January, Mark O'Connor, Paul Hargrove, David Lecomber, Jeffrey M. Squyres, Andrew Lumsdaine |
Checkpoint/Restart-Enabled Parallel Debugging.  |
EuroMPI  |
2010 |
DBLP DOI BibTeX RDF |
|
| 1 | Mohamed-Slim Bouguerra, Thierry Gautier, Denis Trystram, Jean-Marc Vincent |
A Flexible Checkpoint/Restart Model in Distributed Systems.  |
PPAM  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Pierre Riteau, Adrien Lebre, Christine Morin |
Handling Persistent States in Process Checkpoint/Restart Mechanisms for HPC Systems.  |
CCGRID  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Can Ma, Zhigang Huo, Jingnan Cai, Dan Meng |
DCR: A fully transparent checkpoint/restart framework for distributed systems.  |
CLUSTER  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Hiroyuki Takizawa, Katsuto Sato, Kazuhiko Komatsu, Hiroaki Kobayashi |
CheCUDA: A Checkpoint/Restart Tool for CUDA Applications.  |
PDCAT  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Stelios Sidiroglou, Oren Laadan, Carlos Perez, Nicolas Viennot, Jason Nieh, Angelos D. Keromytis |
ASSURE: automatic software self-healing using rescue points.  |
ASPLOS  |
2009 |
DBLP DOI BibTeX RDF |
binary patching, chekpoint restart, reliable software, software self-healing, error recovery |
| 1 | Jason Ansel, Kapil Arya, Gene Cooperman |
DMTCP: Transparent checkpointing for cluster computations and the desktop.  |
IPDPS  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Sukadev Bhattiprolu, Eric W. Biederman, Serge E. Hallyn, Daniel Lezcano |
Virtual servers and checkpoint/restart in mainstream Linux.  |
Operating Systems Review  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Nichamon Naksinehaboon, Yudan Liu, Chokchai Leangsuksun, Raja Nassar, Mihaela Paun, Stephen L. Scott |
Reliability-Aware Approach: An Incremental Checkpoint/Restart Model in HPC Environments.  |
CCGRID  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Justin C. Y. Ho, Cho-Li Wang, Francis C. M. Lau |
Scalable group-based checkpoint/restart for large-scale message-passing systems.  |
IPDPS  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Borja Sotomayor, Kate Keahey, Ian T. Foster |
Combining batch execution and leasing using virtual machines.  |
HPDC  |
2008 |
DBLP DOI BibTeX RDF |
resource leasing, virtual machine overhead, virtual workspaces, virtual machines, resource management, advance reservations, batch processing, checkpoint/restart, backfilling |
| 1 | Satish Kharat, Rajeev Mishra, Ranadip Das, Srikanth Vishwanathan |
Migration of software partition in UNIX system.  |
Bangalore Compute Conf.  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Geoffroy Vallée, Kulathep Charoenpornwattana, Christian Engelmann, Anand Tikotekar, Chokchai Leangsuksun, Thomas Naughton, Stephen L. Scott |
A Framework for Proactive Fault Tolerance.  |
ARES  |
2008 |
DBLP DOI BibTeX RDF |
proactive fault tolerance, clustering, adaptation |
| 1 | Oren Laadan, Jason Nieh |
Transparent Checkpoint-Restart of Multiple Processes on Commodity Operating Systems.  |
USENIX Annual Technical Conference  |
2007 |
DBLP BibTeX RDF |
|
| 1 | Arun Babu Nagarajan, Frank Mueller, Christian Engelmann, Stephen L. Scott |
Proactive fault tolerance for HPC with Xen virtualization.  |
ICS  |
2007 |
DBLP DOI BibTeX RDF |
proactive fault tolerance, virtualization, high-performance computing |
| 1 | Daniele Paolo Scarpazza, Patrick Mullaney, Oreste Villa, Fabrizio Petrini, Vinod Tipparaju, D. M. L. Brown, Jarek Nieplocha |
Transparent system-level migration of PGAS applications using Xen on InfiniBand.  |
CLUSTER  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Anand Tikotekar, Geoffroy Vallée, Thomas Naughton, Stephen L. Scott, Chokchai Leangsuksun |
Evaluation of fault-tolerant policies using simulation.  |
CLUSTER  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Chao Wang, Frank Mueller, Christian Engelmann, Stephen L. Scott |
A Job Pause Service under LAM/MPI+BLCR for Transparent Fault Tolerance.  |
IPDPS  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Ron Oldfield, Sarala Arunagiri, Patricia J. Teller, Seetharami R. Seelam, Maria Ruiz Varela, Rolf Riesen, Philip C. Roth |
Modeling the Impact of Checkpoints on Next-Generation Systems.  |
MSST  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Panfeng Wang, Yunfei Du, Hongyi Fu, Haifang Zhou, Xuejun Yang, Wenjing Yang |
A Novel Fault-Tolerant Parallel Algorithm.  |
APPT  |
2007 |
DBLP DOI BibTeX RDF |
fault tolerance, parallel algorithm, high-performance computing |
| 1 | Rohit Fernandes, Keshav Pingali, Paul Stodghill |
Mobile MPI programs in computational grids.  |
PPOPP  |
2006 |
DBLP DOI BibTeX RDF |
over-decomposition, portable checkpointing, grid computing, MPI, heterogeneity, checkpoint/restart, application-level checkpointing |
| 1 | Tatsuya Ozaki, Tadashi Dohi, Hiroyuki Okamura, Naoto Kaio |
Distribution-Free Checkpoint Placement Algorithms Based on Min-Max Principle.  |
IEEE Trans. Dependable Sec. Comput.  |
2006 |
DBLP DOI BibTeX RDF |
incomplete failure information, performance evaluation, fault-tolerance, maintenance, high availability, modeling and prediction, Checkpoint/restart |
| 1 | Ricardo Marcelín-Jiménez, Sergio Rajsbaum, Brett Stevens |
Cyclic Storage for Fault-Tolerant Distributed Executions.  |
IEEE Trans. Parallel Distrib. Syst.  |
2006 |
DBLP DOI BibTeX RDF |
storage/repositories, network repositories/data mining/backup, fault-tolerance, distributed systems, distributed applications, checkpoint/restart, Load balancing and task assignment |
| 1 | Chao Huang, Gengbin Zheng, Laxmikant V. Kalé, Sameer Kumar |
Performance evaluation of adaptive MPI.  |
PPOPP  |
2006 |
DBLP DOI BibTeX RDF |
processor virtualization, adaptivity, load balancing, MPI, communication optimization |
| 1 | Shaya Potter, Jason Nieh |
Highly Reliable Mobile Desktop Computing in Your Pocket.  |
COMPSAC  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | Francisco Fernández de Vega |
A Fault Tolerant Optimization Algorithm based on Evolutionary Computation.  |
DepCoS-RELCOMEX  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | Hyuck Han, Jai Wug Kim, Jongpil Lee, Youngjin Yu, Kiyoung Kim, Heon Young Yeom |
Practical Fault-Tolerant Framework for eScience Infrastructure.  |
e-Science  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | María Engracia Gómez, Nils Agne Nordbotten, Jose Flich, Pedro López, Antonio Robles, José Duato, Tor Skeie, Olav Lysne |
A Routing Methodology for Achieving Fault Tolerance in Direct Networks.  |
IEEE Trans. Computers  |
2006 |
DBLP DOI BibTeX RDF |
bubble flow control, Fault tolerance, adaptive routing, virtual channels, direct networks |
| 1 | Hatem Ltaief, Marc Garbey, Edgar Gabriel |
Parallel Fault Tolerant Algorithms for Parabolic Problems.  |
Euro-Par  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | Sriram Sankaran, Jeffrey M. Squyres, Brian Barrett, Vishal Sahay, Andrew Lumsdaine, Jason Duell, Paul Hargrove, Eric Roman |
The Lam/Mpi Checkpoint/Restart Framework: System-Initiated Checkpointing.  |
IJHPCA  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | Gladys Utrera, Julita Corbalán, Jesús Labarta |
Another approach to backfilled jobs: applying virtual malleability to expired windows.  |
ICS  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | Kshitij Limaye, Box Leangsuksun, Venkata K. Munganuru, Zeno Greenwood, Stephen L. Scott, Richard Libby, Kasidit Chanchio |
Grid-Aware HA-OSCAR.  |
HPCS  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | Shaya Potter, Jason Nieh |
AutoPod: Unscheduled System Updates with Zero Data Loss.  |
ICAC  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | Adnan Agbaria, Roy Friedman |
A Replication- and Checkpoint-Based Approach for Anomaly-Based Intrusion Detection and Recovery.  |
ICDCS Workshops  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | Matthieu Fertre, Christine Morin |
Extending a Cluster SSI OS for Transparently Checkpointing Message-Passing Parallel Application.  |
ISPAN  |
2005 |
DBLP DOI BibTeX RDF |
global coordination, checkpointing, parallel application, single system image |
| 1 | Limor Fix, Orna Grumberg, Amnon Heyman, Tamir Heyman, Assaf Schuster |
Verifying Very Large Industrial Circuits Using 100 Processes and Beyond.  |
ATVA  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | Gracjan Jankowski, József Kovács, Norbert Meyer, Radoslaw Januszewski, Rafal Mikolajczak |
Towards Checkpointing Grid Architecture.  |
PPAM  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | Pawel Czarnul, Marcin Fraczak |
New User-Guided and ckpt-Based Checkpointing Libraries for Parallel MPI Applications.  |
PVM/MPI  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | Gengbin Zheng, Lixia Shi, Laxmikant V. Kalé |
FTC-Charm++: an in-memory checkpoint-based fault tolerant runtime for Charm++ and MPI.  |
CLUSTER  |
2004 |
DBLP DOI BibTeX RDF |
|
| 1 | Pawel Czarnul, Arkadiusz Urbaniak, Marcin Fraczak, Maciej Dyczkowski, Bartlomiej Balcerek |
Towards Easy-to-Use Checkpointing of MPI Applications within CLUSTERIX.  |
PARELEC  |
2004 |
DBLP DOI BibTeX RDF |
Process Checkpointing, Checkpointing Parallel Applications, Parallel Software Environments |
| 1 | E. N. Elnozahy, James S. Plank |
Checkpointing for Peta-Scale Systems: A Look into the Future of Practical Rollback-Recovery.  |
IEEE Trans. Dependable Sec. Comput.  |
2004 |
DBLP DOI BibTeX RDF |
fault tolerance, modeling, evaluation, Distributed systems, reliability, measurement, serviceability, availability, distributed applications, modeling techniques, performance of systems, simulation of multiple-processor systems |
| 1 | Adnan Agbaria, Roy Friedman |
Virtual Machine Based Heterogeneous Checkpointing. (PDF / PS)  |
IPDPS  |
2002 |
DBLP DOI BibTeX RDF |
|
| 1 | Peter Brezany, Viera Sipková |
Parallel I/O Support for HPF on Clusters.  |
CCGRID  |
2001 |
DBLP DOI BibTeX RDF |
|
| 1 | Miroslav Popovic, Vladimir Kovacevic, M. Skrbic |
Software Reliability and Maintenance Concept Used for Automatic Call Distributor MEDIO ACD.  |
ISSRE  |
2000 |
DBLP DOI BibTeX RDF |
fault-tolerant and robust software, software maintenance and software quality prediction |
| 1 | Adnan Agbaria, Roy Friedman |
Starfish: Fault-Tolerant Dynamic MPI Programs on Clusters of Workstations. (PDF / PS)  |
HPDC  |
1999 |
DBLP DOI BibTeX RDF |
|
| 1 | Lorenzo Alvisi, Keith Marzullo |
Message Logging: Pessimistic, Optimistic, Causal, and Optimal.  |
IEEE Trans. Software Eng.  |
1998 |
DBLP DOI BibTeX RDF |
pessimistic protocols, checkpoint-restart protocols, resilient processes, specification of fault-tolerance techniques, Message logging, optimistic protocols |
| 1 | Kent E. Seamons, Marianne Winslett |
An efficient abstract interface for multidimensional array I/O.  |
SC  |
1994 |
DBLP BibTeX RDF |
|
| 1 | Tom Barclay, Robert Barnes, Jim Gray, Prakash Sundaresan |
Loading Databases Using Dataflow Parallelism.  |
SIGMOD Record  |
1994 |
DBLP DOI BibTeX RDF |
|
| 1 | Bernd Baumgarten, Peter Ochsenschläger |
Modeling and verification of a checkpoint-restart-protocol.  |
Fehlertolerierende Rechensysteme  |
1984 |
DBLP BibTeX RDF |
|
| 1 | Silvia Pfleger |
Implementierte Checkpoint/Restart Fehlertoleranztechnik in der Praxis.  |
Software-Fehlertoleranz und -Zuverlässigkeit  |
1984 |
DBLP BibTeX RDF |
|
| 1 | Raymond A. Lorie |
Physical Integrity in a Large Segmented Database.  |
ACM Trans. Database Syst.  |
1977 |
DBLP DOI BibTeX RDF |
checkpoint-restart, database, recovery, storage management |
| 1 | A. B. Tonik |
Checkpoint, Restart, and Recovery: Selected Annotated Bibliography.  |
FDT - Bulletin of ACM SIGMOD  |
1975 |
DBLP BibTeX RDF |
|
Displaying result #1 - #80 of 80 (100 per page; Change: )
|
|