Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
25 | Carlos Hernández-López, Fernando Pérez Costoya |
Modeling Multithreaded Object Oriented Applications for Rollback Recovery. |
Parallel and Distributed Computing and Networks |
2005 |
DBLP BibTeX RDF |
|
25 | Namyoon Woo, Hyungsoo Jung 0001, Heon Young Yeom, Taesoon Park, Hyungwoo Park |
MPICH-GF: Transparent Checkpointing and Rollback-Recovery for Grid-Enabled MPI Processes. |
IEICE Trans. Inf. Syst. |
2004 |
DBLP BibTeX RDF |
|
25 | MaengSoon Baik, SungJin Choi, Chong-Sun Hwang, Joon-Min Gil, Chan Yeol Park, HeonChang Yoo |
Ω Line Problem in Optimistic Log-Based Rollback Recovery Protocol. |
IEICE Trans. Inf. Syst. |
2004 |
DBLP BibTeX RDF |
|
25 | Taha Osman, Waleed Wagealla, Andrzej Bargiela |
An Approach to Rollback Recovery of Collaborating Mobile Agents. |
IEEE Trans. Syst. Man Cybern. Part C |
2004 |
DBLP DOI BibTeX RDF |
|
25 | Oscar R. Gonzcilez, Arturo Tejada, W. Steven Gray |
Analysis of design trade-offs in the rollback recovery method for fault tolerant digital control systems. |
ACC |
2002 |
DBLP DOI BibTeX RDF |
|
25 | P. Emerald Chung, Woei-Jyh Lee, Yennun Huang, Deron Liang, Chung-Yih Wang |
Winckp: A Transparent Checkpointing and Rollback Recovery Tool for Windows NT Applications. |
FTCS |
1999 |
DBLP DOI BibTeX RDF |
|
25 | Dhiraj K. Pradhan, Nitin H. Vaidya |
Roll-Forward and Rollback Recovery: Performance-Reliability Trade-Off. |
IEEE Trans. Computers |
1997 |
DBLP DOI BibTeX RDF |
|
25 | Viren Shah, Sandeepan Sanyal, Samrat Bhattacharya |
Deadlocks in fully uncoordinated checkpointing rollback recovery systems. |
WORDS |
1997 |
DBLP DOI BibTeX RDF |
|
25 | Xinfeng Ye, John A. Keane |
Concurrent checkpointing and rollback recovery for distributed systems. |
EUROSIM |
1996 |
DBLP BibTeX RDF |
|
25 | Viral Shah, Sourav Bhattacharya |
Fault Message Propagation and its Impact on Rollback Recovery Mechanisms. |
PDPTA |
1996 |
DBLP BibTeX RDF |
|
25 | Neal J. Alewine, Shyh-Kwei Chen, W. Kent Fuchs, Wen-mei W. Hwu |
Compiler-Assisted Multiple Instruction Rollback Recovery Using a Read Buffer. |
IEEE Trans. Computers |
1995 |
DBLP DOI BibTeX RDF |
Fault-tolerance, compilers, error recovery, instruction retry |
25 | Bob Janssens, W. Kent Fuchs |
Ensuring Correct Rollback Recovery in Distributed Shared Memory Systems. |
J. Parallel Distributed Comput. |
1995 |
DBLP DOI BibTeX RDF |
|
25 | Gaurav Suri, Bob Janssens, W. Kent Fuchs |
Reduced Overhead Logging for Rollback Recovery in Distributed Shared Memory. |
FTCS |
1995 |
DBLP DOI BibTeX RDF |
|
25 | E. N. Elnozahy |
On the Relevance of Communication Costs of Rollback-Recovery Protocols. |
PODC |
1995 |
DBLP DOI BibTeX RDF |
|
25 | Yong Deng, E. K. Park |
Checkpointing and rollback-recovery algorithms in distributed systems. |
J. Syst. Softw. |
1994 |
DBLP DOI BibTeX RDF |
|
25 | Dhiraj K. Pradhan, Nitin H. Vaidya |
Roll-Forward and Rollback Recovery: Performance-Reliability Trade-Off. |
FTCS |
1994 |
DBLP DOI BibTeX RDF |
|
25 | Nicholas S. Bowen, Dhiraj K. Pradhan |
Processor- and Memory-Based Checkpoint and Rollback Recovery. |
Computer |
1993 |
DBLP DOI BibTeX RDF |
|
25 | W. Kent Fuchs, Wen-mei W. Hwu, Neal J. Alewine |
Application of Compiler-Assisted Rollback Recovery to Speculative Execution Repair. |
Hardware and Software Architectures for Fault Tolerance |
1993 |
DBLP DOI BibTeX RDF |
|
25 | David B. Johnson 0001 |
Efficient Transparent Optimistic Rollback Recovery for Distributed Application Programs. |
SRDS |
1993 |
DBLP DOI BibTeX RDF |
|
25 | Neal J. Alewine |
Compiler-Assisted Multiple Instruction Rollback Recovery Using a Read Buffer |
|
1993 |
RDF |
|
25 | Jiannong Cao 0001, K. C. Wang |
An Abstract Model of Rollback Recovery Control in Distributed Systems. |
ACM SIGOPS Oper. Syst. Rev. |
1992 |
DBLP DOI BibTeX RDF |
|
25 | David B. Johnson 0001, Willy Zwaenepoel |
Transparent Optimistic Rollback Recovery. |
ACM SIGOPS Oper. Syst. Rev. |
1991 |
DBLP DOI BibTeX RDF |
|
25 | Nicholas S. Bowen, Dhiraj K. Pradhan |
A virtual memory translation mechanism to support checkpoint and rollback recovery. |
SC |
1991 |
DBLP DOI BibTeX RDF |
|
25 | Chung-Chi Li |
Compiler-assisted rollback recovery |
|
1991 |
RDF |
|
25 | William Anthony Manzo |
Performance evaluation of checkpoint rollback recovery algorithms in distributed systems |
|
1991 |
RDF |
|
25 | Luke Lin, Mustaque Ahamad |
Checkpointing and rollback-recovery in distributed object based systems. |
FTCS |
1990 |
DBLP DOI BibTeX RDF |
|
25 | Bharat K. Bhargava, Shy-Renn Lian, Pei-Jyun Leu |
Experimental Evaluation of Concurrency Checkpointing and Rollback-Recovery Algorithms. |
ICDE |
1990 |
DBLP DOI BibTeX RDF |
|
25 | Shambhu J. Upadhyaya |
Rollback recovery in real-time systems with dynamic constraints. |
COMPSAC |
1990 |
DBLP DOI BibTeX RDF |
|
25 | David B. Johnson 0001, Willy Zwaenepoel |
Transparent optimistic rollback recovery. |
ACM SIGOPS European Workshop |
1990 |
DBLP DOI BibTeX RDF |
|
25 | Kun-Lung Wu |
Memory management and rollback recovery in parallel architectures |
|
1990 |
RDF |
|
25 | Zhijun Tong, Richard Y. Kain, Wei-Tek Tsai |
A Low Overhead Checkpointing and Rollback Recovery Scheme for Distributed Systems. |
SRDS |
1989 |
DBLP DOI BibTeX RDF |
|
25 | Shambhu J. Upadhyaya, Kewal K. Saluja |
An experimental study to determine task size for rollback recovery systems. |
IEEE Trans. Computers |
1988 |
DBLP DOI BibTeX RDF |
|
25 | Parameswaran Ramanathan, Kang G. Shin |
Checkpointing and Rollback Recovery in a Distributed System Using Common Time Base. |
SRDS |
1988 |
DBLP DOI BibTeX RDF |
|
25 | K. Venkatesh, Thiruvengadam Radhakrishnan, Hon Fung Li |
Optimal Checkpointing and Local Recording for Domino-Free Rollback Recovery. |
Inf. Process. Lett. |
1987 |
DBLP DOI BibTeX RDF |
|
25 | Richard Koo, Sam Toueg |
Checkpointing and Rollback-Recovery for Distributed Systems. |
IEEE Trans. Software Eng. |
1987 |
DBLP DOI BibTeX RDF |
|
25 | L. Lehmann, J. Brehm |
Rollback Recovery in Multiprocessor Ring Configurations. |
Fehlertolerierende Rechensysteme |
1987 |
DBLP DOI BibTeX RDF |
|
25 | Richard Koo, Sam Toueg |
Checkpointing and Rollback-Recovery for Distributed Systems. |
FJCC |
1986 |
DBLP BibTeX RDF |
|
25 | Andrzej Duda |
Performance Analysis of the Checkpoint-Rollback-Recovery System via Diffusion Approximation. |
Computer Performance and Reliability |
1983 |
DBLP BibTeX RDF |
|
25 | A. M. Feridun, Kang G. Shin |
A Fault-Tolerant Multiprocessor System with Rollback Recovery Capabilities. |
ICDCS |
1981 |
DBLP BibTeX RDF |
|
25 | Erol Gelenbe, D. Derochette |
Performance of Rollback Recovery Systems under Intermittent Failures. |
Commun. ACM |
1978 |
DBLP DOI BibTeX RDF |
|
24 | Sachin Garg, Yennun Huang, Chandra M. R. Kintala, Kishor S. Trivedi |
Minimizing Completion Time of a Program by Checkpointing and Rejuvenation. |
SIGMETRICS |
1996 |
DBLP DOI BibTeX RDF |
|
24 | Nianen Chen, Shangping Ren |
Adaptive optimal checkpoint interval and its impact on system's overall quality in soft real-time applications. |
SAC |
2009 |
DBLP DOI BibTeX RDF |
checkpoint rollback recovery, system overall quality, optimization, soft real-time systems |
24 | Ting Chen, Yongjian Wang, Yuanqiang Huang, Cheng Luo, Depei Qian, Zhongzhi Luan |
A Two-Phase Log-Based Fault Recovery Mechanism in Master/Worker Based Computing Environment. |
ISPA |
2009 |
DBLP DOI BibTeX RDF |
log-based rollback recovery, two-phase recovery, Drug Discovery Grid, fault recovery |
24 | Joshua Hursey, Timothy Mattox, Andrew Lumsdaine |
Interconnect agnostic checkpoint/restart in open MPI. |
HPDC |
2009 |
DBLP DOI BibTeX RDF |
checkpoint coordination protocol, fault tolerance, MPI, shared memory, rollback-recovery, infiniband, myrinet, high speed interconnect, checkpoint/restart |
24 | Chaoguang Men, Zhenpeng Xu, Dongsheng Wang 0002 |
An Efficient Handoff Strategy for Mobile Computing Checkpoint System. |
EUC |
2007 |
DBLP DOI BibTeX RDF |
fault tolerant, mobile computing, checkpoint, handoff, rollback recovery |
24 | Rodrigo Schmidt, Islene C. Garcia, Fernando Pedone, Luiz Eduardo Buzato |
Optimal Asynchronous Garbage Collection for RDT Checkpointing Protocols. |
ICDCS |
2005 |
DBLP DOI BibTeX RDF |
rollback-dependency trackability, garbage collection, rollback-recovery, distributed checkpointing |
24 | Rodrigo Schmidt, Islene C. Garcia, Fernando Pedone, Luiz Eduardo Buzato |
Brief announcement: optimal asynchronous garbage collection for checkpointing protocols with rollback-dependency trackability. |
PODC |
2004 |
DBLP DOI BibTeX RDF |
rollback-dependency trackability, garbage collection, rollback-recovery, distributed checkpointing |
24 | Kuo-Feng Ssu, W. Kent Fuchs, Hewijin Christine Jiau |
Process Recovery in Heterogeneous Systems. |
IEEE Trans. Computers |
2003 |
DBLP DOI BibTeX RDF |
portable checkpointing, Heterogeneous systems, rollback recovery, process migration |
24 | Jichiang Tsai 0001 |
On Properties of RDT Communication-Induced Checkpointing Protocols. |
IEEE Trans. Parallel Distributed Syst. |
2003 |
DBLP DOI BibTeX RDF |
rollback-dependency trackability, communication-induced checkpointing protocols, fault tolerance, Distributed systems, rollback-recovery |
24 | Chi-Yi Lin, Szu-Chi Wang, Sy-Yen Kuo |
An Efficient Time-Based Checkpointing Protocol for Mobile Computing Systems over Mobile IP. |
Mob. Networks Appl. |
2003 |
DBLP DOI BibTeX RDF |
checkpointing and rollback-recovery, fault tolerance, mobile computing |
24 | Florin Sultan, Thu D. Nguyen, Liviu Iftode |
Lazy Garbage Collection of Recovery State for Fault-Tolerant Distributed Shared Memory. |
IEEE Trans. Parallel Distributed Syst. |
2002 |
DBLP DOI BibTeX RDF |
log-based rollback recovery, Fault tolerance, garbage collection, checkpointing, distributed shared memory |
24 | Florin Sultan, Thu D. Nguyen, Liviu Iftode |
Lazy Garbage Collection of Recovery State for Fault-Tolerant Distributed Shared Memory. |
IEEE Trans. Parallel Distributed Syst. |
2002 |
DBLP DOI BibTeX RDF |
log-based rollback recovery, Fault tolerance, garbage collection, checkpointing, distributed shared memory |
24 | Tadashi Dohi, Naoto Kaio, Kishor S. Trivedi |
Availability Models with Age-Dependent Checkpointing. |
SRDS |
2002 |
DBLP DOI BibTeX RDF |
age-dependent model, approximation, availability, checkpoint, file system, rollback recovery |
24 | Francesco Quaglia |
A Cost Model for Selecting Checkpoint Positions in Time Warp Parallel Simulation. |
IEEE Trans. Parallel Distributed Syst. |
2001 |
DBLP DOI BibTeX RDF |
checkpointing, cost models, performance optimization, time warp, rollback-recovery, Parallel discrete-event simulation, optimistic synchronization |
24 | Islene C. Garcia, Luiz Eduardo Buzato |
On the Minimal Characterization of the Rollback-Dependency Trackability Property. |
ICDCS |
2001 |
DBLP DOI BibTeX RDF |
zigzag paths, fault-tolerance, Distributed algorithms, rollback recovery, distributed checkpointing |
24 | Taesoon Park, Heon Young Yeom |
A Low Overhead Logging Scheme for Fast Recovery in Distributed Shared Memory Systems. |
J. Supercomput. |
2000 |
DBLP DOI BibTeX RDF |
checkpointing, rollback-recovery, fault tolerant system, message logging, distributed shared memory system |
24 | Bina Ramamurthy, Shambhu J. Upadhyaya, Bharat K. Bhargava |
Design and Analysis of an Integrated Checkpointing Recovery Scheme for Distributed Applications. |
IEEE Trans. Knowl. Data Eng. |
2000 |
DBLP DOI BibTeX RDF |
distributed systems, Checkpointing, concurrent error detection, rollback recovery, message logging |
24 | Sriram Rao, Lorenzo Alvisi, Harrick M. Vin |
The Cost of Recovery in Message Logging Protocols. |
IEEE Trans. Knowl. Data Eng. |
2000 |
DBLP DOI BibTeX RDF |
log-based rollback recovery, pessimistic protocols, causal protocols, fault tolerance, Distributed computing, optimistic protocols, hybrid protocols |
24 | Katsuya Tanaka, Makoto Takizawa 0001 |
Asynchronous Checkpointing Protocol for Object-Based Systems. |
ISORC |
2000 |
DBLP DOI BibTeX RDF |
Distributed Object-based System, Fault-Tolerant, Group communication, Rollback Recovery, Asynchronous protocol |
24 | Katsuya Tanaka, Makoto Takizawa 0001 |
Checkpointing Protocol for Object-Based Systems. |
ICPADS |
2000 |
DBLP DOI BibTeX RDF |
communication-induced protocol, fault-tolerant, checkpoint, rollback recovery, object-based system |
24 | Francesco Quaglia, Vittorio Cortellessa, Bruno Ciciani |
Trade-Off between Sequential and Time Warp-Based Parallel Simulation. |
IEEE Trans. Parallel Distributed Syst. |
1999 |
DBLP DOI BibTeX RDF |
performance evaluation, discrete event simulation, Time Warp, rollback-recovery, parallel and distributed simulation, Virtual time |
24 | Roberto Baldoni, Francesco Quaglia, Paolo Fornara |
An Index-Based Checkpointing Algorithm for Autonomous Distributed Systems. |
IEEE Trans. Parallel Distributed Syst. |
1999 |
DBLP DOI BibTeX RDF |
timestamp management, global snapshot, performance evaluation, fault tolerance, distributed systems, protocols, Checkpointing, rollback-recovery, causal dependency |
24 | Nitin H. Vaidya |
Staggered Consistent Checkpointing. |
IEEE Trans. Parallel Distributed Syst. |
1999 |
DBLP DOI BibTeX RDF |
Staggered checkpoints, consistent recovery line, stable storage contention, fault tolerance, rollback recovery |
24 | Jean-Michel Hélary, Robert H. B. Netzer, Michel Raynal |
Consistency Issues in Distributed Checkpoints. |
IEEE Trans. Software Eng. |
1999 |
DBLP DOI BibTeX RDF |
transitlessness, fault-tolerance, distributed systems, consistency, Checkpointing, rollback recovery, strong consistency |
24 | Lorenzo Alvisi, E. N. Elnozahy, Sriram Rao, Syed Amir Husain, Asanka De Mel |
An Analysis of Communication Induced Checkpointing. |
FTCS |
1999 |
DBLP DOI BibTeX RDF |
Performance Evaluation, MPI, Checkpointing, Rollback Recovery, Consistent Global States |
24 | William R. Dieter, James E. Lumpp Jr. |
A User-Level Checkpointing Library for POSIX Threads Programs. |
FTCS |
1999 |
DBLP DOI BibTeX RDF |
multithreaded, Unix, checkpointing, threads, rollback recovery, Solaris |
24 | Hyosoon Lee, Heonshik Shin, Sang Lyul Min |
Worst Case Timing Requirement of Real-Time Tasks with Time Redundancy. |
RTCSA |
1999 |
DBLP DOI BibTeX RDF |
Fault Tolerance, Real-Time Scheduling, Rollback Recovery, Time Redundancy |
24 | James S. Plank, Kai Li 0001, Michael A. Puening |
Diskless Checkpointing. |
IEEE Trans. Parallel Distributed Syst. |
1998 |
DBLP DOI BibTeX RDF |
memory redundancy, RAID systems, Fault tolerance, error-correcting codes, checkpointing, rollback recovery, copy-on-write |
24 | Jichiang Tsai 0001, Sy-Yen Kuo, Yi-Min Wang |
Theoretical Analysis for Communication-Induced Checkpointing Protocols with Rollback-Dependency Trackability. |
IEEE Trans. Parallel Distributed Syst. |
1998 |
DBLP DOI BibTeX RDF |
Rollback-dependency trackability, communication-induced protocols, checkpointing, on-line algorithms, rollback recovery |
24 | Yunjung Yi, Taesoon Park, Heon Young Yeom |
A Causal Logging Scheme for Lazy Release Consistent Distributed Shared Memory Systems. |
ICPADS |
1998 |
DBLP DOI BibTeX RDF |
Lazy release consistency, Fault tolerance, Checkpointing, Rollback-recovery, Message logging, Distributed shared memory system |
24 | Yi-Min Wang, Yennun Huang, W. Kent Fuchs, Chandra M. R. Kintala, Gaurav Suri |
Progressive Retry for Software Failure Recovery in Message-Passing Applications. |
IEEE Trans. Computers |
1997 |
DBLP DOI BibTeX RDF |
message reordering, recovery escalation, Fault tolerance, distributed systems, protocols, checkpointing, logging, rollback recovery, telecommunication systems |
24 | Erwin Duschnig, Reinhold Weiss |
Design of a Distributed Fault-Tolerant Computer Architecture Applied to the Traffic Control System "IVMS". |
ISPAN |
1996 |
DBLP DOI BibTeX RDF |
distributed artificially intelligent system, fault tolerance, rollback recovery |
24 | Nicholas S. Bowen, Dhiraj K. Pradhan |
A Fault Tolerant Hybrid Memory Structure and Memory Management Algorithms. |
IEEE Trans. Computers |
1995 |
DBLP DOI BibTeX RDF |
Checkpoint and rollback recovery, hybrid memory, fault tolerance, memory management, virtual memory |
24 | Chung-Chi Jim Li, Shyh-Kwei Chen, W. Kent Fuchs, Wen-mei W. Hwu |
Compiler-Based Multiple Instruction Retry. |
IEEE Trans. Computers |
1995 |
DBLP DOI BibTeX RDF |
compilers, fault-tolerant computing, rollback recovery, instruction retry |
24 | Nicholas S. Bowen, Dhiraj K. Pradhan |
Virtual Checkpoints: Architecture and Performance. |
IEEE Trans. Computers |
1992 |
DBLP DOI BibTeX RDF |
virtual checkpoints, virtual memory translation hardware, performance evaluation, performance analysis, fault tolerant computing, trace-driven simulation, rollback recovery, failure tolerance, address space |
24 | Kun-Lung Wu, W. Kent Fuchs |
Recoverable Distributed Shared Virtual Memory. |
IEEE Trans. Computers |
1990 |
DBLP DOI BibTeX RDF |
distributed shared virtual environments, loosely coupled distributed multicomputer system, user-transparent checkpointing recovery scheme, twin-page disk storage management technique, memory coherence protocol, distributed processing, storage management, virtual memory, rollback recovery, virtual storage |
24 | Asser N. Tantawi, Manfred Ruschitzka |
Performance Analysis of Checkpointing Strategies |
ACM Trans. Comput. Syst. |
1984 |
DBLP DOI BibTeX RDF |
equicost checkpointing strategy, equidistant checkpointing strategy, performance modeling and optimization, error recovery, rollback recovery, system availability, database recovery |
24 | Yann-Hang Lee, Kang G. Shin |
Design and Evaluation of a Fault-Tolerant Multiprocessur Using Hardware Recovery Blocks. |
IEEE Trans. Computers |
1984 |
DBLP DOI BibTeX RDF |
rollback propagation, hardware/ software recovery blocks, performance of rollback recovery mechanisms, Fault-tolerant multiprocessor |
22 | Chandreyee Chowdhury, Sarmistha Neogy |
A Consistent Checkpointing-Recovery Protocol for Minimal Number of Nodes in Mobile Computing System. |
HiPC |
2007 |
DBLP DOI BibTeX RDF |
consistency, Checkpointing, Recovery, message logging, Mobile computing system |
22 | Ravi Prakash 0001, Mukesh Singhal |
Low-Cost Checkpointing and Failure Recovery in Mobile Computing Systems. |
IEEE Trans. Parallel Distributed Syst. |
1996 |
DBLP DOI BibTeX RDF |
global snapshot, Checkpointing, recovery, portable computers, mobile computing systems, causal dependency |
18 | Gabriel Rodríguez 0001, Patricia González, María J. Martín, Juan Touriño |
Enhancing Fault-Tolerance of Large-Scale MPI Scientific Applications. |
PaCT |
2007 |
DBLP DOI BibTeX RDF |
Fault tolerance, MPI, checkpointing, parallel applications |
18 | D. Manivannan 0001, Qiangfeng Jiang, Jianchang Yang, Karl E. Persson, Mukesh Singhal |
An Asynchronous Recovery Algorithm Based on a Staggered Quasi-Synchronous Checkpointing Algorithm. |
IWDC |
2005 |
DBLP DOI BibTeX RDF |
|
18 | Michael R. Lyu, Xinyu Chen, Tsz Yeung Wong |
Design and Evaluation of a Fault-Tolerant Mobile-Agent System. |
IEEE Intell. Syst. |
2004 |
DBLP DOI BibTeX RDF |
|
18 | Gyung-Leen Park, Hee Yong Youn, Hyunseung Choo |
Optimal Checkpoint Interval Analysis Using Stochastic Petri Net. |
PRDC |
2001 |
DBLP DOI BibTeX RDF |
|
18 | Vítor N. Távora, Luís Moura Silva, João Gabriel Silva |
Distributed Checkpointing Mechanism for a Parallel File System. |
PVM/MPI |
2000 |
DBLP DOI BibTeX RDF |
fault-tolerance and reliability, file checkpointing, extensions and improvements to pvm, checkpointing, parallel i/o |
18 | Yi-Min Wang, Pi-Yu Chung, In-Jen Lin, W. Kent Fuchs |
Checkpoint Space Reclamation for Uncoordinated Checkpointing in Message-Passing Systems.. |
IEEE Trans. Parallel Distributed Syst. |
1995 |
DBLP DOI BibTeX RDF |
|
14 | Alireza Ejlali, Bashir M. Al-Hashimi, Petru Eles |
A standby-sparing technique with low energy-overhead for fault-tolerant hard real-time systems. |
CODES+ISSS |
2009 |
DBLP DOI BibTeX RDF |
reliability, energy minimization, hard real-time systems |
14 | Petru Eles, Viacheslav Izosimov, Paul Pop, Zebo Peng |
Synthesis of Fault-Tolerant Embedded Systems. |
DATE |
2008 |
DBLP DOI BibTeX RDF |
|
14 | Hideyuki Jitsumoto, Toshio Endo, Satoshi Matsuoka |
Environmental-aware optimization of MPI checkpointing intervals. |
CLUSTER |
2008 |
DBLP DOI BibTeX RDF |
|
14 | Partha Sarathi Mandal 0001, Krishnendu Mukhopadhyaya |
Checkpointing Using Mobile Agents in Distributed Systems. |
ICCTA |
2007 |
DBLP DOI BibTeX RDF |
|
14 | José Carlos Mouriño, María J. Martín, Patricia González, Ramon Doallo |
Fault-tolerant solutions for a MPI compute intensive application. |
PDP |
2007 |
DBLP DOI BibTeX RDF |
|
14 | Qi Gao 0004, Wei Huang 0003, Matthew J. Koop, Dhabaleswar K. Panda 0001 |
Group-based Coordinated Checkpointing for MPI: A Case Study on InfiniBand. |
ICPP |
2007 |
DBLP DOI BibTeX RDF |
|
14 | Ying Zhang 0041, Krishnendu Chakrabarty |
A unified approach for fault tolerance and dynamic power management in fixed-priority real-time embedded systems. |
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. |
2006 |
DBLP DOI BibTeX RDF |
|
14 | Angelo Duarte, Dolores Rexachs, Emilio Luque |
An Intelligent Management of Fault Tolerance in Cluster Using RADICMPI. |
PVM/MPI |
2006 |
DBLP DOI BibTeX RDF |
|
14 | Zizhong Chen, Jack J. Dongarra |
Algorithm-based checkpoint-free fault tolerance for parallel matrix computations on volatile resources. |
IPDPS |
2006 |
DBLP DOI BibTeX RDF |
|
14 | Qi Gao 0004, Weikuan Yu, Wei Huang 0003, Dhabaleswar K. Panda 0001 |
Application-Transparent Checkpoint/Restart for MPI Programs over InfiniBand. |
ICPP |
2006 |
DBLP DOI BibTeX RDF |
|
14 | Viacheslav Izosimov, Paul Pop, Petru Eles, Zebo Peng |
Synthesis of Fault-Tolerant Embedded Systems with Checkpointing and Replication. |
DELTA |
2006 |
DBLP DOI BibTeX RDF |
|
14 | Angelo Duarte, Dolores Rexachs, Emilio Luque |
Increasing the cluster availability using RADIC. |
CLUSTER |
2006 |
DBLP DOI BibTeX RDF |
|
14 | Youhui Zhang, Dongsheng Wong, Weimin Zheng |
User-level checkpoint and recovery for LAM/MPI. |
ACM SIGOPS Oper. Syst. Rev. |
2005 |
DBLP DOI BibTeX RDF |
Linux |
14 | Naoki Kobayashi 0005, Tadashi Dohi |
Bayesian Perspective of Optimal Checkpoint Placement. |
HASE |
2005 |
DBLP DOI BibTeX RDF |
|