Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
120 | Taesoon Park, Namyoon Woo, Heon Young Yeom |
An Efficient Optimistic Message Logging Scheme for Recoverable Mobile Computing Systems. |
IEEE Trans. Mob. Comput. |
2002 |
DBLP DOI BibTeX RDF |
asynchronous recovery, fault tolerance, Distributed systems, mobile computing, message logging |
115 | Lorenzo Alvisi, Karan Bhatia, Keith Marzullo |
Causality tracking in causal message-logging protocols. |
Distributed Comput. |
2002 |
DBLP DOI BibTeX RDF |
Causal logging, Causality tracking, Message logging |
112 | Luís Moura Silva, João Gabriel Silva |
Using Message Semantics for Fast-Output Commit in Checkpointing-and-Rollback Recovery. |
HICSS |
1999 |
DBLP DOI BibTeX RDF |
Checkpointing, Message-Logging, Message-passing systems, Crash-Recovery |
103 | Pierre Lemarinier, Aurélien Bouteiller, Thomas Hérault, Géraud Krawezik, Franck Cappello |
Improved message logging versus improved coordinated checkpointing for fault tolerant MPI. |
CLUSTER |
2004 |
DBLP DOI BibTeX RDF |
|
101 | Jinho Ahn |
2-step algorithm for enhancing effectiveness of sender-based message logging. |
SpringSim (2) |
2007 |
DBLP BibTeX RDF |
fault-tolerance, distributed systems, garbage collection, message logging |
100 | Thomas Ropars, Christine Morin |
Active Optimistic Message Logging for Reliable Execution of MPI Applications. |
Euro-Par |
2009 |
DBLP DOI BibTeX RDF |
|
89 | Lorenzo Alvisi, Keith Marzullo |
Message Logging: Pessimistic, Optimistic, Causal, and Optimal. |
IEEE Trans. Software Eng. |
1998 |
DBLP DOI BibTeX RDF |
pessimistic protocols, checkpoint-restart protocols, resilient processes, specification of fault-tolerance techniques, Message logging, optimistic protocols |
85 | |
Design, Analysis and Performance Evaluation of a New Algorithm for Developing a Fault Tolerant Distributed System. |
ICPADS (1) |
2006 |
DBLP DOI BibTeX RDF |
Critical interval, output commit, lost messages etc |
75 | Kwang-Sik Chung, Kibom Kim, Chong-Sun Hwang, Jin Gon Shon, Heon-Chang Yu |
Hybrid checkpointing protocol based on selective-sender-based message logging. |
ICPADS |
1997 |
DBLP DOI BibTeX RDF |
hybrid checkpointing protocol, selective-sender-based message logging, asynchronous checkpointing protocol, failure-free operation, cascade rollback, message dependency tree, search time, protocols, failure recovery |
72 | Aurélien Bouteiller, Boris Collin, Thomas Hérault, Pierre Lemarinier, Franck Cappello |
Impact of Event Logger on Causal Message Logging Protocols for Fault Tolerant MPI. |
IPDPS |
2005 |
DBLP DOI BibTeX RDF |
|
70 | E. N. Elnozahy, Willy Zwaenepoel |
Manetho: Transparent Rollback-Recovery with Low Overhead, Limited Rollback, and Fast Output Commit. |
IEEE Trans. Computers |
1992 |
DBLP DOI BibTeX RDF |
Manetho, transparent rollback-recovery protocol, antecedence graph maintenance, uncoordinated checkpointing, sender-based message logging, pessimistic message logging, output commit, optimistic message logging, failure-free overhead, distributed computations, graph theory, fault tolerant computing |
68 | Sriram Rao, Lorenzo Alvisi, Harrick M. Vin |
The Cost of Recovery in Message Logging Protocols. |
IEEE Trans. Knowl. Data Eng. |
2000 |
DBLP DOI BibTeX RDF |
log-based rollback recovery, pessimistic protocols, causal protocols, fault tolerance, Distributed computing, optimistic protocols, hybrid protocols |
66 | Ch. D. V. Subba Rao, M. M. Naidu |
A new, efficient coordinated checkpointing protocol combined with selective sender-based message logging. |
AICCSA |
2008 |
DBLP DOI BibTeX RDF |
|
66 | Inseon Lee, Heon Young Yeom, Taesoon Park, Hyoung-Woo Park |
A Lightweight Message Logging Scheme for Fault Tolerant MPI. |
PPAM |
2003 |
DBLP DOI BibTeX RDF |
|
64 | Kwang-Sik Chung, Heon-Chang Yu, Seongbin Park |
Garbage Collection in a Causal Message Logging Protocol. |
HPCC |
2005 |
DBLP DOI BibTeX RDF |
|
64 | Xinyu Chen, Michael R. Lyu |
Message Logging and Recovery in Wireless CORBA Using Access Bridge. |
ISADS |
2003 |
DBLP DOI BibTeX RDF |
Wireless CORBA, Fault tolerance, Mobile computing, Failure recovery, Message logging |
64 | Taesoon Park, Heon Young Yeom |
An Asynchronous Recovery Scheme based on Optimistic Message Logging for Mobile Computing Systems. |
ICDCS |
2000 |
DBLP DOI BibTeX RDF |
Asynchronous recovery, Fault-tolerance, Distributed systems, Mobile computing, Message logging |
63 | JinHo Ahn |
Checkpointing and Communication Pattern-Neutral Algorithm for Removing Messages Logged by Senders. |
HPCC |
2006 |
DBLP DOI BibTeX RDF |
fault-tolerance, garbage collection, checkpointing, message logging, message-passing system |
63 | Aurélien Bouteiller, Franck Cappello, Thomas Hérault, Géraud Krawezik, Pierre Lemarinier, Frédéric Magniette |
MPICH-V2: a Fault Tolerant MPI for Volatile Nodes based on Pessimistic Sender Based Message Logging. |
SC |
2003 |
DBLP DOI BibTeX RDF |
|
60 | Thomas Ropars, Christine Morin |
Fault Tolerance in Cluster Federations with O2P-CF. |
CCGRID |
2008 |
DBLP DOI BibTeX RDF |
Cluster federation, message passing application, fault tolerance, message logging |
57 | Karan Bhatia, Keith Marzullo, Lorenzo Alvisi |
Scalable Causal Message Logging for Wide-Area Environments. |
Euro-Par |
2001 |
DBLP DOI BibTeX RDF |
|
46 | Pierre Sens 0001 |
The performance of independent checkpointing in distributed systems. |
HICSS (2) |
1995 |
DBLP DOI BibTeX RDF |
independent checkpointing, run-time overhead, message logging mechanism, fault management overhead, long-running distributed applications, small data streams, performance evaluation, fault tolerance, performance, distributed systems, parallel processing, fault tolerant computing, distributed processing, Unix, local area networks, system recovery, workstations, parallel applications, workstations network |
46 | Kuo-Feng Ssu, Bin Yao, W. Kent Fuchs |
An Adaptive Checkpointing Protocol to Bound Recovery Time with Message Logging. |
SRDS |
1999 |
DBLP DOI BibTeX RDF |
|
46 | Robert H. B. Netzer, Yikang Xu |
Replaying Distributed Programs without Message Logging. |
HPDC |
1997 |
DBLP DOI BibTeX RDF |
|
46 | Zizhong Chen, Jack J. Dongarra |
Algorithm-Based Fault Tolerance for Fail-Stop Failures. |
IEEE Trans. Parallel Distributed Syst. |
2008 |
DBLP DOI BibTeX RDF |
|
46 | Jinmin Yang, Dafang Zhang, Zheng Qin 0001, Xue Dong Yang |
WINDAR: A Multithreaded Rollback-Recovery Toolkit on Windows. |
PRDC |
2004 |
DBLP DOI BibTeX RDF |
|
44 | Esteban Meneses, Laxmikant V. Kalé, Greg Bronevetsky |
Dynamic Load Balance for Optimized Message Logging in Fault Tolerant HPC Applications. |
CLUSTER |
2011 |
DBLP DOI BibTeX RDF |
causal message logging, fault tolerance, load balancing |
43 | Zunce Wei, Hon Fung Li, Dhrubajyoti Goswami |
A Locality-Driven Atomic Group Checkpoint Protocol. |
PDCAT |
2006 |
DBLP DOI BibTeX RDF |
|
41 | Hugo Meyer, Ronal Muresano, Marcela Castro-León, Dolores Rexachs, Emilio Luque |
Hybrid Message Pessimistic Logging. Improving current pessimistic message logging protocols. |
J. Parallel Distributed Comput. |
2017 |
DBLP DOI BibTeX RDF |
|
40 | Bin Yao, W. Kent Fuchs |
Message Logging Optimization for Wireless Networks. |
SRDS |
2001 |
DBLP DOI BibTeX RDF |
|
40 | Bin Yao, Kuo-Feng Ssu, W. Kent Fuchs |
Message Logging in Mobile Computing. |
FTCS |
1999 |
DBLP DOI BibTeX RDF |
|
40 | Angkul Kongmunvattana, Nian-Feng Tzeng |
Coherence-Centric Logging and Recovery for Home-Based Software Distributed Shared Memory. |
ICPP |
1999 |
DBLP DOI BibTeX RDF |
|
37 | Aurélien Bouteiller, Pierre Lemarinier, Géraud Krawezik, Franck Cappello |
Coordinated Checkpoint versus Message Log for Fault Tolerant MPI. |
CLUSTER |
2003 |
DBLP DOI BibTeX RDF |
Fault tolerant MPI, performance, message log, coordinated checkpoint |
35 | Pierre Sens 0001 |
Performance Evaluation of Fault Tolerance for Parallel Applications in Networked Environments. |
ICPP |
1997 |
DBLP DOI BibTeX RDF |
fault-tolerance, performances, distributed systems, checkpointing, message logging |
35 | JinHo Ahn, Sung-Gi Min, Chong-Sun Hwang |
Consistent and Efficient Recovery for Causal Message Logging. |
ICOIN (2) |
2002 |
DBLP DOI BibTeX RDF |
|
35 | JinHo Ahn, Sung-Gi Min, Chong-Sun Hwang |
Low-Cost Garbage Collection for Causal Message Logging. |
HiPC |
2001 |
DBLP DOI BibTeX RDF |
|
35 | Kalman Z. Meth, William G. Tuel Jr. |
Parallel Checkpoint/Restart without Message Logging. |
ICPP Workshops |
2000 |
DBLP DOI BibTeX RDF |
|
35 | Shahnaz Afroz, Hee Yong Youn, Dongman Lee |
Performance of Message Logging Protocols for NOWs with MPI. |
PRDC |
1999 |
DBLP DOI BibTeX RDF |
|
34 | Sayantan Chakravorty, Laxmikant V. Kalé |
A Fault Tolerance Protocol with Fast Fault Recovery. |
IPDPS |
2007 |
DBLP DOI BibTeX RDF |
|
34 | JinHo Ahn, Chong-Sun Hwang |
Low-Cost Fault-Tolerance for Mobile Nodes in Mobile IP Based Systems. |
ICDCS Workshops |
2001 |
DBLP DOI BibTeX RDF |
|
33 | Thomas Ropars, Christine Morin |
Improving Message Logging Protocols Scalability through Distributed Event Logging. |
Euro-Par (1) |
2010 |
DBLP DOI BibTeX RDF |
|
32 | Nomica Imran, Imran Rao, Young-Koo Lee, Sungyoung Lee |
A proxy-based uncoordinated checkpointing scheme with pessimistic message logging for mobile grid systems. |
HPDC |
2007 |
DBLP DOI BibTeX RDF |
mobile grid systems, fault tolerance, checkpointing |
31 | Angkul Kongmunvattana, Nian-Feng Tzeng |
Logging and Recovery in Adaptive Software Distributed Shared Memory Systems. |
SRDS |
1999 |
DBLP DOI BibTeX RDF |
|
28 | David B. Lomet, Gerhard Weikum |
Efficient and Transparent Application Recovery in Client-Server Information Systems. |
SIGMOD Conference |
1998 |
DBLP DOI BibTeX RDF |
|
27 | Jinho Ahn |
N Fault-Tolerant Sender-Based Message Logging for Group Communication-Based Message Passing Systems. |
CSE |
2014 |
DBLP DOI BibTeX RDF |
|
27 | Thomas Ropars, Christine Morin |
Active optimistic and distributed message logging for message-passing applications. |
Concurr. Comput. Pract. Exp. |
2011 |
DBLP DOI BibTeX RDF |
|
27 | Yi-Wei Ci, Zhan Zhang, De-Cheng Zuo, Xiao-Zong Yang |
Message fragment based causal message logging. |
J. Parallel Distributed Comput. |
2009 |
DBLP DOI BibTeX RDF |
|
27 | Mehdi Aminian, Mohammad K. Akbari, Bahman Javadi |
Coordinated checkpoint from message payload in pessimistic sender-based message logging. |
IPDPS |
2006 |
DBLP DOI BibTeX RDF |
|
27 | JinHo Ahn, Chong-Sun Hwang |
Efficient Garbage Collection Schemes for Causal Message Logging with Independent Checkpointing in Message Passing Systems. |
IPDPS |
2001 |
DBLP DOI BibTeX RDF |
|
27 | Robert H. B. Netzer, Sairam Subramanian, Jian Xu |
Critical-Path-Based Message Logging for incremental Replay of Message-Passing Programs. |
ICDCS |
1994 |
DBLP DOI BibTeX RDF |
|
27 | Hong Va Leong, Divyakant Agrawal |
Using Message Semantics to Reduce Rollback in Optimistic Message Logging Recovery Schemes. |
ICDCS |
1994 |
DBLP DOI BibTeX RDF |
|
27 | Robert H. B. Netzer, Jian Xu |
Adaptive message logging for incremental replay of message-passing programs. |
SC |
1993 |
DBLP DOI BibTeX RDF |
DEBUG |
27 | Yi-Min Wang, W. Kent Fuchs |
Optimistic Message Logging for Independent Checkpointing in Message-Passing Systems. |
SRDS |
1992 |
DBLP DOI BibTeX RDF |
|
26 | Namyoon Woo, Hyungsoo Jung 0001, Dongin Shin, Hyuck Han, Heon Young Yeom, Taesoon Park |
Performance Evaluation of Consistent Recovery Protocols Using MPICH-GF. |
EDCC |
2005 |
DBLP DOI BibTeX RDF |
|
23 | Chandreyee Chowdhury, Sarmistha Neogy |
A Consistent Checkpointing-Recovery Protocol for Minimal Number of Nodes in Mobile Computing System. |
HiPC |
2007 |
DBLP DOI BibTeX RDF |
consistency, Checkpointing, Recovery, message logging, Mobile computing system |
23 | E. N. Elnozahy, Lorenzo Alvisi, Yi-Min Wang, David B. Johnson 0001 |
A survey of rollback-recovery protocols in message-passing systems. |
ACM Comput. Surv. |
2002 |
DBLP DOI BibTeX RDF |
rollback-recovery, message logging |
23 | Masato Kitakami, Shunji Kubota, Hideo Ito |
Fault-Tolerance of Functional Programs Based on the Parallel Graph Reduction. |
PRDC |
2001 |
DBLP DOI BibTeX RDF |
Referential transparency, Fault tolerance, functional programming, message logging, graph reduction |
23 | Taesoon Park, Heon Young Yeom |
A Low Overhead Logging Scheme for Fast Recovery in Distributed Shared Memory Systems. |
J. Supercomput. |
2000 |
DBLP DOI BibTeX RDF |
checkpointing, rollback-recovery, fault tolerant system, message logging, distributed shared memory system |
23 | Bina Ramamurthy, Shambhu J. Upadhyaya, Bharat K. Bhargava |
Design and Analysis of an Integrated Checkpointing Recovery Scheme for Distributed Applications. |
IEEE Trans. Knowl. Data Eng. |
2000 |
DBLP DOI BibTeX RDF |
distributed systems, Checkpointing, concurrent error detection, rollback recovery, message logging |
23 | Sriram Rao, Lorenzo Alvisi, Harrick M. Vin |
Egida: An Extensible Toolkit for Low-Overhead Fault-Tolerance. |
FTCS |
1999 |
DBLP DOI BibTeX RDF |
transparent fault-tolerance, middleware, MPI, checkpointing, message-logging |
23 | J. Hamilton Slye, E. N. Elnozahy |
Support for Software Interrupts in Log-Based Rollback-Recovery. |
IEEE Trans. Computers |
1998 |
DBLP DOI BibTeX RDF |
instruction counters, distributed systems, Checkpointing, rollback-recovery, message logging |
23 | Yunjung Yi, Taesoon Park, Heon Young Yeom |
A Causal Logging Scheme for Lazy Release Consistent Distributed Shared Memory Systems. |
ICPADS |
1998 |
DBLP DOI BibTeX RDF |
Lazy release consistency, Fault tolerance, Checkpointing, Rollback-recovery, Message logging, Distributed shared memory system |
23 | Subbarayan Venkatesan, Tong-Ying Tony Juang, Sridhar Alagar |
Optimistic Crash Recovery without Changing Application Messages. |
IEEE Trans. Parallel Distributed Syst. |
1997 |
DBLP DOI BibTeX RDF |
fail-stop failures, optimistic message logging, distributed algorithms, time complexity, message complexity, Crash recovery |
23 | Zizhong Chen |
Extending algorithm-based fault tolerance to tolerate fail-stop failures in high performance distributed environments. |
IPDPS |
2008 |
DBLP DOI BibTeX RDF |
|
23 | Rui Wang 0002, Betty Salzberg, David B. Lomet |
Log-based recovery for middleware servers. |
SIGMOD Conference |
2007 |
DBLP DOI BibTeX RDF |
application fault tolerance, exactly-once execution, optimistic logging, distributed systems, recovery |
23 | Partha Sarathi Mandal 0001, Krishnendu Mukhopadhyaya |
Estimating Checkpointing, Rollback and Recovery Overheads. |
IWDC |
2003 |
DBLP DOI BibTeX RDF |
|
20 | Jinho Ahn |
Efficient Sender-Based Message Logging Tolerating Simultaneous Failures with Always No Rollback Property. |
Symmetry |
2023 |
DBLP DOI BibTeX RDF |
|
20 | Kiril Dichev, Daniele De Sensi, Dimitrios S. Nikolopoulos, Kirk W. Cameron, Ivor T. A. Spence |
Power Log'n'Roll: Power-Efficient Localized Rollback for MPI Applications Using Message Logging Protocols. |
IEEE Trans. Parallel Distributed Syst. |
2022 |
DBLP DOI BibTeX RDF |
|
20 | Jinho Ahn |
Enhanced Sender-Based Message Logging for Reducing Forced Checkpointing Overhead in Distributed Systems. |
IEICE Trans. Inf. Syst. |
2021 |
DBLP DOI BibTeX RDF |
|
20 | Jinho Ahn |
Scalable Sender-Based Message Logging Protocol with Little Communication Overhead for Distributed Systems. |
Parallel Process. Lett. |
2019 |
DBLP DOI BibTeX RDF |
|
20 | Kiril Dichev, Dimitrios S. Nikolopoulos |
Implementing Efficient Message Logging Protocols as MPI Application Extensions. |
CoRR |
2019 |
DBLP BibTeX RDF |
|
20 | Nuria Losada, George Bosilca, Aurélien Bouteiller, Patricia González, María J. Martín |
Local rollback for resilient MPI applications with application-level checkpointing and message logging. |
Future Gener. Comput. Syst. |
2019 |
DBLP DOI BibTeX RDF |
|
20 | Kiril Dichev, Dimitrios S. Nikolopoulos |
Implementing efficient message logging protocols as MPI application extensions. |
EuroMPI |
2019 |
DBLP DOI BibTeX RDF |
|
20 | Jinho Ahn |
Hybrid Message Logging Protocol with Little Overhead for Two-Level Hierarchical and Distributed Architectures. |
IEICE Trans. Inf. Syst. |
2018 |
DBLP DOI BibTeX RDF |
|
20 | Jinho Ahn |
Broadcast Network-Based Sender Based Message Logging for Overcoming Multiple Failures. |
IEICE Trans. Inf. Syst. |
2017 |
DBLP DOI BibTeX RDF |
|
20 | Esteban Meneses |
Exploring Application-Level Message-Logging in Scalable HPC Programs. |
CARLA |
2017 |
DBLP DOI BibTeX RDF |
|
20 | Esteban Meneses |
Reducing the Overhead of Message Logging in Fault-Tolerant HPC Applications. |
CARLA |
2016 |
DBLP DOI BibTeX RDF |
|
20 | Jinmin Yang |
A Lightweight Causal Message Logging Protocol to Lower Fault Tolerance Overhead. |
CLUSTER |
2016 |
DBLP DOI BibTeX RDF |
|
20 | Esteban Meneses, Laxmikant V. Kalé |
Camel: collective-aware message logging. |
J. Supercomput. |
2015 |
DBLP DOI BibTeX RDF |
|
20 | Parmeet Kaur Jaggi, Awadhesh Kumar Singh |
Movement-Based Checkpointing and Message Logging for Recovery in MANETs. |
Wirel. Pers. Commun. |
2015 |
DBLP DOI BibTeX RDF |
|
20 | Tatiana V. Martsinkevich, Thomas Ropars, Franck Cappello |
Addressing the Last Roadblock for Message Logging in HPC: Alleviating the Memory Requirement Using Dedicated Resources. |
Euro-Par Workshops |
2015 |
DBLP DOI BibTeX RDF |
|
20 | Md. Tarikul Islam, Hien Nguyen 0004, Jaspal Subhlok, Edgar Gabriel |
Efficient Message Logging to Support Process Replicas in a Volunteer Computing Environment. |
IPDPS Workshops |
2015 |
DBLP DOI BibTeX RDF |
|
20 | Jinho Ahn |
Group Sender-based Message Logging Protocol for Conquering Simultaneous Failures. |
ICADIWT |
2015 |
DBLP DOI BibTeX RDF |
|
20 | Hideyuki Jitsumoto, Yuki Todoroki, Yutaka Ishikawa, Mitsuhisa Sato |
Grid-Oriented Process Clustering System for Partial Message Logging. |
DSN |
2014 |
DBLP DOI BibTeX RDF |
|
20 | Hugo Meyer, Dolores Rexachs, Emilio Luque |
Hybrid Message Logging. Combining advantages of Sender-based and Receiver-based Approaches. |
ICCS |
2014 |
DBLP DOI BibTeX RDF |
|
20 | Jonathan Lifflander, Esteban Meneses, Harshitha Menon, Phil Miller, Sriram Krishnamoorthy, Laxmikant V. Kalé |
Scalable replay with partial-order dependencies for message-logging fault tolerance. |
CLUSTER |
2014 |
DBLP DOI BibTeX RDF |
|
20 | Aurélien Bouteiller, Thomas Hérault, George Bosilca, Jack J. Dongarra |
Correlated set coordination in fault tolerant message logging protocols for many-core clusters. |
Concurr. Comput. Pract. Exp. |
2013 |
DBLP DOI BibTeX RDF |
|
20 | Jinho Ahn |
On Reducing Rollback Propagation Effect of Optimistic Message Logging for Group-Based Distributed Systems. |
IEICE Trans. Inf. Syst. |
2013 |
DBLP DOI BibTeX RDF |
|
20 | Xunyun Liu, Xinhai Xu, Xiaoguang Ren, Yuhua Tang, Ziqing Dai |
A Message Logging Protocol Based on User Level Failure Mitigation. |
ICA3PP (1) |
2013 |
DBLP DOI BibTeX RDF |
|
20 | Jinho Ahn |
An effective method for optimistic message logging based on group communication links. |
IRI |
2013 |
DBLP DOI BibTeX RDF |
|
20 | Esteban Meneses |
Scalable message-logging techniques for effective fault tolerance in HPC applications |
|
2013 |
RDF |
|
20 | Yi Luo 0002, D. Manivannan 0001 |
HOPE: A Hybrid Optimistic checkpointing and selective Pessimistic mEssage logging protocol for large scale distributed systems. |
Future Gener. Comput. Syst. |
2012 |
DBLP DOI BibTeX RDF |
|
20 | Esteban Meneses, Xiang Ni, Laxmikant V. Kalé |
A message-logging protocol for multicore systems. |
DSN Workshops |
2012 |
DBLP DOI BibTeX RDF |
|
20 | Bidyut Gupta, Ruslan Nikolaev 0001, Raja Chirra |
A Recovery Scheme for Cluster Federations Using Sender-based Message Logging. |
J. Comput. Inf. Technol. |
2011 |
DBLP DOI BibTeX RDF |
|
20 | Jinho Ahn |
Lightweight Consistent Recovery Algorithm for Sender-Based Message Logging in Distributed Systems. |
IEICE Trans. Inf. Syst. |
2011 |
DBLP DOI BibTeX RDF |
|
20 | Aurélien Bouteiller, Thomas Hérault, George Bosilca, Jack J. Dongarra |
Correlated Set Coordination in Fault Tolerant Message Logging Protocols. |
Euro-Par (2) |
2011 |
DBLP DOI BibTeX RDF |
|
20 | Thomas Ropars, Amina Guermouche, Bora Uçar, Esteban Meneses, Laxmikant V. Kalé, Franck Cappello |
On the Use of Cluster-Based Partial Message Logging to Improve Fault Tolerance for MPI HPC Applications. |
Euro-Par (1) |
2011 |
DBLP DOI BibTeX RDF |
|
20 | Esteban Meneses, Greg Bronevetsky, Laxmikant V. Kalé |
Evaluation of Simple Causal Message Logging for Large-Scale Fault Tolerant HPC Systems. |
IPDPS Workshops |
2011 |
DBLP DOI BibTeX RDF |
|
20 | Jinho Ahn |
Effective sender-based message logging algorithm with checkpointing considering transient communication errors. |
HPCS |
2011 |
DBLP DOI BibTeX RDF |
|
20 | Yi-Wei Ci, Zhan Zhang, De-Cheng Zuo, Zhibo Wu, Xiao-Zong Yang |
Dependency mining-based causal message logging. |
Inf. Process. Lett. |
2010 |
DBLP DOI BibTeX RDF |
|