|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 108 occurrences of 76 keywords
|
|
|
Results
Found 163 publication records. Showing 163 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
151 | Satish Kharat, Rajeev Mishra, Ranadip Das, Srikanth Vishwanathan |
Migration of software partition in UNIX system. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Bangalore Compute Conf. ![In: Proceedings of the 1st Bangalore Annual Compute Conference, Compute 2008, Bangalore, India, January 18-20, 2008, pp. 22, 2008, ACM, 978-1-59593-950-0. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
|
143 | Joshua Hursey, Timothy Mattox, Andrew Lumsdaine |
Interconnect agnostic checkpoint/restart in open MPI. ![Search on Bibsonomy](Pics/bibsonomy.png) |
HPDC ![In: Proceedings of the 18th ACM International Symposium on High Performance Distributed Computing, HPDC 2009, Garching, Germany, June 11-13, 2009, pp. 49-58, 2009, ACM, 978-1-60558-587-1. The full citation details ...](Pics/full.jpeg) |
2009 |
DBLP DOI BibTeX RDF |
checkpoint coordination protocol, fault tolerance, MPI, shared memory, rollback-recovery, infiniband, myrinet, high speed interconnect, checkpoint/restart |
127 | Oren Laadan, Dan B. Phung, Jason Nieh |
Transparent Checkpoint-Restart of Distributed Applications on Commodity Clusters. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CLUSTER ![In: 2005 IEEE International Conference on Cluster Computing (CLUSTER 2005), September 26 - 30, 2005, Boston, Massachusetts, USA, pp. 1-13, 2005, IEEE Computer Society, 0-7803-9485-2. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
|
124 | Chao Wang 0056, Frank Mueller 0001, Christian Engelmann, Stephen L. Scott |
A Job Pause Service under LAM/MPI+BLCR for Transparent Fault Tolerance. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IPDPS ![In: 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), Proceedings, 26-30 March 2007, Long Beach, California, USA, pp. 1-10, 2007, IEEE. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
|
123 | Oreste Villa, Sriram Krishnamoorthy, Jarek Nieplocha, David M. Brown Jr. |
Scalable transparent checkpoint-restart of global address space applications on virtual machines over infiniband. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Conf. Computing Frontiers ![In: Proceedings of the 6th Conference on Computing Frontiers, 2009, Ischia, Italy, May 18-20, 2009, pp. 197-206, 2009, ACM, 978-1-60558-413-3. The full citation details ...](Pics/full.jpeg) |
2009 |
DBLP DOI BibTeX RDF |
checkpoint-restart, virtual machines, infiniband, global address space |
108 | G. John Janakiraman, Jose Renato Santos, Dinesh Subhraveti, Yoshio Turner |
Cruz: Application-Transparent Distributed Checkpoint-Restart on Standard Operating Systems. ![Search on Bibsonomy](Pics/bibsonomy.png) |
DSN ![In: 2005 International Conference on Dependable Systems and Networks (DSN 2005), 28 June - 1 July 2005, Yokohama, Japan, Proceedings, pp. 260-269, 2005, IEEE Computer Society, 0-7695-2282-3. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
|
107 | Yudan Liu, Raja Nassar, Chokchai Leangsuksun, Nichamon Naksinehaboon, Mihaela Paun, Stephen L. Scott |
An optimal checkpoint/restart model for a large scale high performance computing system. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IPDPS ![In: 22nd IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2008, Miami, Florida USA, April 14-18, 2008, pp. 1-9, 2008, IEEE. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
|
107 | Yudan Liu, Raja Nassar, Chokchai Leangsuksun, Nichamon Naksinehaboon, Mihaela Paun, Stephen L. Scott |
A reliability-aware approach for an optimal checkpoint/restart model in HPC environments. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CLUSTER ![In: Proceedings of the 2007 IEEE International Conference on Cluster Computing, 17-20 September 2007, Austin, Texas, USA, pp. 452-457, 2007, IEEE Computer Society, 978-1-4244-1387-4. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
|
105 | Gengbin Zheng, Lixia Shi, Laxmikant V. Kalé |
FTC-Charm++: an in-memory checkpoint-based fault tolerant runtime for Charm++ and MPI. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CLUSTER ![In: 2004 IEEE International Conference on Cluster Computing (CLUSTER 2004), September 20-23 2004, San Diego, California, USA, pp. 93-103, 2004, IEEE Computer Society, 0-7803-8694-9. The full citation details ...](Pics/full.jpeg) |
2004 |
DBLP DOI BibTeX RDF |
|
101 | Jiannong Cao 0001, Yinghao Li, Minyi Guo |
Process Migration for MPI Applications based on Coordinated Checkpoint. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICPADS (1) ![In: 11th International Conference on Parallel and Distributed Systems, ICPADS 2005, Fuduoka, Japan, July 20-22, 2005, pp. 306-312, 2005, IEEE Computer Society, 0-7695-2281-5. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
MPI, process migration, checkpoint/restart, coordinated checkpoint |
95 | Gracjan Jankowski, József Kovács, Norbert Meyer, Radoslaw Januszewski, Rafal Mikolajczak |
Towards Checkpointing Grid Architecture. ![Search on Bibsonomy](Pics/bibsonomy.png) |
PPAM ![In: Parallel Processing and Applied Mathematics, 6th International Conference, PPAM 2005, Poznan, Poland, September 11-14, 2005, Revised Selected Papers, pp. 659-666, 2005, Springer, 3-540-34141-2. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
|
89 | Joshua Hursey, Jeffrey M. Squyres, Timothy Mattox, Andrew Lumsdaine |
The Design and Implementation of Checkpoint/Restart Process Fault Tolerance for Open MPI. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IPDPS ![In: 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), Proceedings, 26-30 March 2007, Long Beach, California, USA, pp. 1-8, 2007, IEEE. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
|
89 | R. Badrinath, R. Krishnakumar, R. K. Palanivel Rajan |
Virtualization aware job schedulers for checkpoint-restart. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICPADS ![In: 13th International Conference on Parallel and Distributed Systems, ICPADS 2007, Hsinchu, Taiwan, December 5-7, 2007, pp. 1-7, 2007, IEEE Computer Society, 978-1-4244-1889-3. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
|
70 | Qi Gao 0004, Weikuan Yu, Wei Huang 0003, Dhabaleswar K. Panda 0001 |
Application-Transparent Checkpoint/Restart for MPI Programs over InfiniBand. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICPP ![In: 2006 International Conference on Parallel Processing (ICPP 2006), 14-18 August 2006, Columbus, Ohio, USA, pp. 471-478, 2006, IEEE Computer Society, 0-7695-2636-5. The full citation details ...](Pics/full.jpeg) |
2006 |
DBLP DOI BibTeX RDF |
|
70 | José Carlos Sancho, Fabrizio Petrini, Kei Davis, Roberto Gioiosa, Song Jiang 0001 |
Current Practice and a Direction Forward in Checkpoint/Restart Implementations for Fault Tolerance. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IPDPS ![In: 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), CD-ROM / Abstracts Proceedings, 4-8 April 2005, Denver, CO, USA, 2005, IEEE Computer Society, 0-7695-2312-9. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
|
70 | Kalman Z. Meth, William G. Tuel Jr. |
Parallel Checkpoint/Restart without Message Logging. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICPP Workshops ![In: Proceedings of the 2000 International Workshop on Parallel Processing, ICPPW 2000, Toronto, Canada, August 21-24, 2000, pp. 253-258, 2000, IEEE Computer Society, 0-7695-0771-9. The full citation details ...](Pics/full.jpeg) |
2000 |
DBLP DOI BibTeX RDF |
|
67 | Xiangyong Ouyang, Raghunath Rajachandrasekar, Xavier Besseron, Hao Wang 0002, Jian Huang 0006, Dhabaleswar K. Panda 0001 |
CRFS: A Lightweight User-Level Filesystem for Generic Checkpoint/Restart. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICPP ![In: International Conference on Parallel Processing, ICPP 2011, Taipei, Taiwan, September 13-16, 2011, pp. 375-384, 2011, IEEE Computer Society, 978-1-4577-1336-1. The full citation details ...](Pics/full.jpeg) |
2011 |
DBLP DOI BibTeX RDF |
checkpoint-restart, userspace filesystem, write aggregation |
66 | Jason Ansel, Kapil Arya, Gene Cooperman |
DMTCP: Transparent checkpointing for cluster computations and the desktop. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IPDPS ![In: 23rd IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2009, Rome, Italy, May 23-29, 2009, pp. 1-12, 2009, IEEE. The full citation details ...](Pics/full.jpeg) |
2009 |
DBLP DOI BibTeX RDF |
|
60 | Geoffroy Vallée, Renaud Lottiaux, David Margery, Christine Morin |
Ghost Process: a Sound Basis to Implement Process Duplication, Migration and Checkpoint/Restart in Linux Clusters. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ISPDC ![In: 4th International Symposium on Parallel and Distributed Computing (ISPDC 2005), 4-6 July 2005, Lille, France, pp. 97-104, 2005, IEEE Computer Society, 0-7695-2434-6. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
process virtualization, distributed system, operating system, Linux cluster, single system image |
60 | Yudan Liu, Chokchai Leangsuksun, Hertong Song, Stephen L. Scott |
Reliability-aware Checkpoint/Restart Scheme: A Performability Trade-off. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CLUSTER ![In: 2005 IEEE International Conference on Cluster Computing (CLUSTER 2005), September 26 - 30, 2005, Boston, Massachusetts, USA, pp. 1-8, 2005, IEEE Computer Society, 0-7803-9485-2. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
|
58 | Adnan Agbaria, Roy Friedman |
Virtual Machine Based Heterogeneous Checkpointing. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IPDPS ![In: 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 15-19 April 2002, Fort Lauderdale, FL, USA, CD-ROM/Abstracts Proceedings, 2002, IEEE Computer Society, 0-7695-1573-8. The full citation details ...](Pics/full.jpeg) |
2002 |
DBLP DOI BibTeX RDF |
|
54 | Stephen L. Scott, Christian Engelmann, Geoffroy Vallée, Thomas J. Naughton, Anand Tikotekar, George Ostrouchov, Chokchai Leangsuksun, Nichamon Naksinehaboon, Raja Nassar, Mihaela Paun, Frank Mueller 0001, Chao Wang 0056, Arun Babu Nagarajan, Jyothish Varma |
A tunable holistic resiliency approach for high-performance computing systems. ![Search on Bibsonomy](Pics/bibsonomy.png) |
PPoPP ![In: Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP 2009, Raleigh, NC, USA, February 14-18, 2009, pp. 305-306, 2009, ACM, 978-1-60558-397-6. The full citation details ...](Pics/full.jpeg) |
2009 |
DBLP DOI BibTeX RDF |
preemptive migration, fault tolerance, high-performance computing, resilience, checkpoint/restart |
54 | Shaya Potter, Jason Nieh |
WebPod: persistent Web browsing sessions with pocketable storage devices. ![Search on Bibsonomy](Pics/bibsonomy.png) |
WWW ![In: Proceedings of the 14th international conference on World Wide Web, WWW 2005, Chiba, Japan, May 10-14, 2005, pp. 603-612, 2005, ACM, 1-59593-046-9. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
portable storage, virtualization, web browsing, process migration, checkpoint/restart |
54 | Adnan Agbaria, Roy Friedman |
Starfish: Fault-Tolerant Dynamic MPI Programs on Clusters of Workstations. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Clust. Comput. ![In: Clust. Comput. 6(3), pp. 227-236, 2003. The full citation details ...](Pics/full.jpeg) |
2003 |
DBLP DOI BibTeX RDF |
fault-tolerance, distributed system, MPI, high performance, checkpoint/restart |
54 | Andrea Clematis, Vittoria Gianuzzi |
CPVM - Extending PVM for Consistent Checkpointing. ![Search on Bibsonomy](Pics/bibsonomy.png) |
PDP ![In: 4th Euromicro Workshop on Parallel and Distributed Processing (PDP '96), January 24-26, 1996, Portugal, pp. 67-74, 1996, IEEE Computer Society, 0-8186-7376-1. The full citation details ...](Pics/full.jpeg) |
1996 |
DBLP DOI BibTeX RDF |
CPVM, consistent checkpointing, global checkpoint-restart algorithms, job-swapping, parallel programming, software tools, concurrency control, migration, deadlocks, termination, software fault tolerance, software fault-tolerance, software libraries, software library, PVM, Parallel Virtual Machine, software portability, nonblocking |
54 | Vittoria Gianuzzi, F. Merani |
Using PVM to implement a distributed dependable simulation system. ![Search on Bibsonomy](Pics/bibsonomy.png) |
PDP ![In: 3rd Euromicro Workshop on Parallel and Distributed Processing (PDP '95), January 25-27, 1995, San Remo, Italy, pp. 529-537, 1995, IEEE Computer Society, 0-8186-7031-2. The full citation details ...](Pics/full.jpeg) |
1995 |
DBLP DOI BibTeX RDF |
distributed dependable simulation system, PVM routines, fault tolerant mechanisms, checkpoint-restart mechanism, distributed algorithms, distributed algorithms, fault tolerant computing, message passing, synchronisation, simulations modelling, Virtual Time, high speed interconnection |
48 | Yawei Li, Zhiling Lan |
FREM: A Fast Restart Mechanism for General Checkpoint/Restart. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Computers ![In: IEEE Trans. Computers 60(5), pp. 639-652, 2011. The full citation details ...](Pics/full.jpeg) |
2011 |
DBLP DOI BibTeX RDF |
|
48 | Kathryn M. Mohror, Adam Moody, Bronis R. de Supinski |
Asynchronous checkpoint migration with MRNet in the Scalable Checkpoint / Restart Library. ![Search on Bibsonomy](Pics/bibsonomy.png) |
DSN Workshops ![In: IEEE/IFIP International Conference on Dependable Systems and Networks Workshops, DSN 2012, Boston, MA, USA, June 25-28, 2012, pp. 1-6, 2012, IEEE Computer Society, 978-1-4673-2264-5. The full citation details ...](Pics/full.jpeg) |
2012 |
DBLP DOI BibTeX RDF |
|
47 | Arun Babu Nagarajan, Frank Mueller 0001, Christian Engelmann, Stephen L. Scott |
Proactive fault tolerance for HPC with Xen virtualization. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICS ![In: Proceedings of the 21th Annual International Conference on Supercomputing, ICS 2007, Seattle, Washington, USA, June 17-21, 2007, pp. 23-32, 2007, ACM, 978-1-59593-768-1. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
proactive fault tolerance, virtualization, high-performance computing |
47 | Matthieu Fertre, Christine Morin |
Extending a Cluster SSI OS for Transparently Checkpointing Message-Passing Parallel Application. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ISPAN ![In: 8th International Symposium on Parallel Architectures, Algorithms, and Networks, ISPAN 2005, December 7-9. 2005, Las Vegas, Nevada, USA, pp. 364-369, 2005, IEEE Computer Society, 0-7695-2509-1. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
global coordination, checkpointing, parallel application, single system image |
38 | Limor Fix, Orna Grumberg, Amnon Heyman, Tamir Heyman, Assaf Schuster |
Verifying Very Large Industrial Circuits Using 100 Processes and Beyond. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ATVA ![In: Automated Technology for Verification and Analysis, Third International Symposium, ATVA 2005, Taipei, Taiwan, October 4-7, 2005, Proceedings, pp. 11-25, 2005, Springer, 3-540-29209-8. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
|
35 | Borja Sotomayor, Kate Keahey, Ian T. Foster |
Combining batch execution and leasing using virtual machines. ![Search on Bibsonomy](Pics/bibsonomy.png) |
HPDC ![In: Proceedings of the 17th International Symposium on High-Performance Distributed Computing (HPDC-17 2008), 23-27 June 2008, Boston, MA, USA, pp. 87-96, 2008, ACM, 978-1-59593-997-5. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
resource leasing, virtual machine overhead, virtual workspaces, virtual machines, resource management, advance reservations, batch processing, checkpoint/restart, backfilling |
35 | Ricardo Marcelín-Jiménez, Sergio Rajsbaum, Brett Stevens |
Cyclic Storage for Fault-Tolerant Distributed Executions. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Parallel Distributed Syst. ![In: IEEE Trans. Parallel Distributed Syst. 17(9), pp. 1028-1036, 2006. The full citation details ...](Pics/full.jpeg) |
2006 |
DBLP DOI BibTeX RDF |
storage/repositories, network repositories/data mining/backup, fault-tolerance, distributed systems, distributed applications, checkpoint/restart, Load balancing and task assignment |
35 | Tatsuya Ozaki, Tadashi Dohi, Hiroyuki Okamura, Naoto Kaio |
Distribution-Free Checkpoint Placement Algorithms Based on Min-Max Principle. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Dependable Secur. Comput. ![In: IEEE Trans. Dependable Secur. Comput. 3(2), pp. 130-140, 2006. The full citation details ...](Pics/full.jpeg) |
2006 |
DBLP DOI BibTeX RDF |
incomplete failure information, performance evaluation, fault-tolerance, maintenance, high availability, modeling and prediction, Checkpoint/restart |
35 | Rohit Fernandes, Keshav Pingali, Paul Stodghill |
Mobile MPI programs in computational grids. ![Search on Bibsonomy](Pics/bibsonomy.png) |
PPoPP ![In: Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP 2006, New York, New York, USA, March 29-31, 2006, pp. 22-31, 2006, ACM, 1-59593-189-9. The full citation details ...](Pics/full.jpeg) |
2006 |
DBLP DOI BibTeX RDF |
over-decomposition, portable checkpointing, grid computing, MPI, heterogeneity, checkpoint/restart, application-level checkpointing |
35 | Lorenzo Alvisi, Keith Marzullo |
Message Logging: Pessimistic, Optimistic, Causal, and Optimal. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Software Eng. ![In: IEEE Trans. Software Eng. 24(2), pp. 149-159, 1998. The full citation details ...](Pics/full.jpeg) |
1998 |
DBLP DOI BibTeX RDF |
pessimistic protocols, checkpoint-restart protocols, resilient processes, specification of fault-tolerance techniques, Message logging, optimistic protocols |
35 | Raymond A. Lorie |
Physical Integrity in a Large Segmented Database. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ACM Trans. Database Syst. ![In: ACM Trans. Database Syst. 2(1), pp. 91-104, 1977. The full citation details ...](Pics/full.jpeg) |
1977 |
DBLP DOI BibTeX RDF |
checkpoint-restart, database, recovery, storage management |
32 | Lei Wang 0042, Jiyuan Liu 0007, Qiang He 0001 |
Concept Drift-Based Checkpoint-Restart for Edge Services Rejuvenation. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Serv. Comput. ![In: IEEE Trans. Serv. Comput. 16(3), pp. 1713-1725, May - June 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
32 | Akira Nukada, Taichiro Suzuki, Satoshi Matsuoka |
Efficient checkpoint/Restart of CUDA applications. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Parallel Comput. ![In: Parallel Comput. 116, pp. 103018, 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
32 | Basma Abdel Azeem, Manal Helal |
Performance Evaluation of Checkpoint/Restart Techniques. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2311.17545, 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
32 | Yao Xu, Leonid Belyaev, Twinkle Jain, Derek Schafer, Anthony Skjellum, Gene Cooperman |
Implementation-Oblivious Transparent Checkpoint-Restart for MPI. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2309.14996, 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
32 | Yao Xu, Leonid Belyaev, Twinkle Jain, Derek Schafer, Anthony Skjellum, Gene Cooperman |
Implementation-Oblivious Transparent Checkpoint-Restart for MPI. ![Search on Bibsonomy](Pics/bibsonomy.png) |
SC Workshops ![In: Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis, SC-W 2023, Denver, CO, USA, November 12-17, 2023, pp. 1738-1747, 2023, ACM. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
32 | Niklas Eiling, Stefan Lankes, Antonello Monti |
Checkpoint/Restart for CUDA Kernels. ![Search on Bibsonomy](Pics/bibsonomy.png) |
SC Workshops ![In: Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis, SC-W 2023, Denver, CO, USA, November 12-17, 2023, pp. 1728-1737, 2023, ACM. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
32 | Niklas Eiling, Jonas Baude, Stefan Lankes, Antonello Monti |
Cricket: A virtualization layer for distributed execution of CUDA applications with checkpoint/restart support. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Concurr. Comput. Pract. Exp. ![In: Concurr. Comput. Pract. Exp. 34(14), 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
32 | Genlang Chen, Jiajian Zhang, Zufang Zhu, Hao Wang, Hai Jiang 0003, Chaoyi Pang |
CRAC: An automatic assistant compiler of checkpoint/restart for OpenCL program. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Concurr. Comput. Pract. Exp. ![In: Concurr. Comput. Pract. Exp. 34(8), 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
32 | Genlang Chen, Jiajian Zhang, Zufang Zhu, Qiangqiang Jiang, Hai Jiang 0003, Chaoyi Pang |
CRState: checkpoint/restart of OpenCL program for in-kernel applications. ![Search on Bibsonomy](Pics/bibsonomy.png) |
J. Supercomput. ![In: J. Supercomput. 77(6), pp. 5426-5467, 2021. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
32 | Guangye Chen, Luis Chacón, Truong B. Nguyen |
An unsupervised machine-learning checkpoint-restart algorithm using Gaussian mixtures for particle-in-cell simulations. ![Search on Bibsonomy](Pics/bibsonomy.png) |
J. Comput. Phys. ![In: J. Comput. Phys. 436, pp. 110185, 2021. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
32 | Anthony Skjellum, Derek Schafer |
Checkpoint-Restart Libraries Must Become More Fault Tolerant. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2112.10814, 2021. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP BibTeX RDF |
|
32 | Guangye Chen, Luis Chacón, Truong B. Nguyen |
An unsupervised machine-learning checkpoint-restart algorithm using Gaussian mixtures for particle-in-cell simulations. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2105.13797, 2021. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP BibTeX RDF |
|
32 | Prashant Singh Chouhan, Gregory Price, Gene Cooperman |
An Architecture for Exploiting Native User-Land Checkpoint-Restart to Improve Fuzzing. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2112.10100, 2021. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP BibTeX RDF |
|
32 | Rajeev Jain, Klaus Weide, Saurabh Chawdhary, Thomas Klostermann |
Checkpoint/Restart for Lagrangian particle mesh with AMR in community code FLASH-X. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2103.04267, 2021. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP BibTeX RDF |
|
32 | Kfir Zvi, Gal Oren 0001 |
Optimized Memoryless Fair-Share HPC Resources Scheduling using Transparent Checkpoint-Restart Preemption. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2102.12953, 2021. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP BibTeX RDF |
|
32 | Masoud Gholami, Florian Schintke |
Combining XOR and Partner Checkpointing for Resilient Multilevel Checkpoint/Restart. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IPDPS ![In: 35th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2021, Portland, OR, USA, May 17-21, 2021, pp. 277-288, 2021, IEEE, 978-1-6654-4066-0. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
32 | Shashank Gugnani, Tianxi Li, Xiaoyi Lu |
NVMe-CR: A Scalable Ephemeral Storage Runtime for Checkpoint/Restart with NVMe-over-Fabrics. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IPDPS ![In: 35th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2021, Portland, OR, USA, May 17-21, 2021, pp. 172-181, 2021, IEEE, 978-1-6654-4066-0. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
32 | Konstantinos Parasyris, Giorgis Georgakoudis, Leonardo Bautista-Gomez, Ignacio Laguna |
Co-Designing Multi-Level Checkpoint Restart for MPI Applications. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CCGRID ![In: 21st IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing, CCGrid 2021, Melbourne, Australia, May 10-13, 2021, pp. 103-112, 2021, IEEE, 978-1-7281-9586-5. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
32 | Twinkle Jain, Gene Cooperman |
CRAC: Checkpoint-Restart Architecture for CUDA with Streams and UVM. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2008.10596, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
32 | Tonmoy Dey, Kento Sato, Bogdan Nicolae, Jian Guo, Jens Domke, Weikuan Yu, Franck Cappello, Kathryn M. Mohror |
Optimizing Asynchronous Multi-Level Checkpoint/Restart Configurations with Machine Learning. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IPDPS Workshops ![In: 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2020, New Orleans, LA, USA, May 18-22, 2020, pp. 1036-1043, 2020, IEEE, 978-1-7281-7445-7. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
32 | Konstantinos Parasyris, Kai Keller, Leonardo Bautista-Gomez, Osman S. Unsal |
Checkpoint Restart Support for Heterogeneous HPC Applications. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CCGRID ![In: 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing, CCGRID 2020, Melbourne, Australia, May 11-14, 2020, pp. 242-251, 2020, IEEE, 978-1-7281-6095-5. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
32 | Twinkle Jain, Gene Cooperman |
CRAC: checkpoint-restart architecture for CUDA with streams and UVM. ![Search on Bibsonomy](Pics/bibsonomy.png) |
SC ![In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2020, Virtual Event / Atlanta, Georgia, USA, November 9-19, 2020, pp. 77, 2020, IEEE/ACM, 978-1-7281-9998-6. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
32 | Arif Ahmed 0001, Apoorve Mohan, Gene Cooperman, Guillaume Pierre |
Docker Container Deployment in Distributed Fog Infrastructures with Checkpoint/Restart. ![Search on Bibsonomy](Pics/bibsonomy.png) |
MobileCloud ![In: 8th IEEE International Conference on Mobile Cloud Computing, Services, and Engineering, MobileCloud 2020, Oxford, United Kingdom, August 3-6, 2020, pp. 55-62, 2020, IEEE, 978-1-7281-1035-6. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
32 | Marina Morán, Javier Aldo Balladini, Dolores Rexachs, Emilio Luque |
Prediction of Energy Consumption by Checkpoint/Restart in HPC. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Access ![In: IEEE Access 7, pp. 71791-71803, 2019. The full citation details ...](Pics/full.jpeg) |
2019 |
DBLP DOI BibTeX RDF |
|
32 | Manuel Aurelio Rodriguez Pascual, Jiajun Cao, José A. Moríñigo, Gene Cooperman, Rafael Mayo-García |
Job migration in HPC clusters by means of checkpoint/restart. ![Search on Bibsonomy](Pics/bibsonomy.png) |
J. Supercomput. ![In: J. Supercomput. 75(10), pp. 6517-6541, 2019. The full citation details ...](Pics/full.jpeg) |
2019 |
DBLP DOI BibTeX RDF |
|
32 | Faisal Shahzad 0001, Jonas Thies, Moritz Kreutzer, Thomas Zeiser, Georg Hager, Gerhard Wellein |
CRAFT: A Library for Easier Application-Level Checkpoint/Restart and Automatic Fault Tolerance. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Parallel Distributed Syst. ![In: IEEE Trans. Parallel Distributed Syst. 30(3), pp. 501-514, 2019. The full citation details ...](Pics/full.jpeg) |
2019 |
DBLP DOI BibTeX RDF |
|
32 | Julien Adam, Maxime Kermarquer, Jean-Baptiste Besnard, Leonardo Bautista-Gomez, Marc Pérache, Patrick Carribault, Julien Jaeger, Allen D. Malony, Sameer Shende |
Checkpoint/restart approaches for a thread-based MPI runtime. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Parallel Comput. ![In: Parallel Comput. 85, pp. 204-219, 2019. The full citation details ...](Pics/full.jpeg) |
2019 |
DBLP DOI BibTeX RDF |
|
32 | Julien Adam, Maxime Kermarquer, Jean-Baptiste Besnard, Leonardo Bautista-Gomez, Marc Pérache, Patrick Carribault, Julien Jaeger, Allen D. Malony, Sameer Shende |
Checkpoint/restart approaches for a thread-based MPI runtime. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/1906.05020, 2019. The full citation details ...](Pics/full.jpeg) |
2019 |
DBLP BibTeX RDF |
|
32 | Jumpol Yaothanee, Kasidit Chanchio |
An In-Memory Checkpoint-Restart Mechanism for a Cluster of Virtual Machines. ![Search on Bibsonomy](Pics/bibsonomy.png) |
JCSSE ![In: 16th International Joint Conference on Computer Science and Software Engineering, JCSSE 2019, Chonburi, Thailand, July 10-12, 2019, pp. 131-136, 2019, IEEE, 978-1-7281-0719-6. The full citation details ...](Pics/full.jpeg) |
2019 |
DBLP DOI BibTeX RDF |
|
32 | Genlang Chen, Jiajian Zhang, Zufang Zhu, Chaoyan Zhu, Hai Jiang 0003, Chaoyi Pang |
CRAC: An Automatic Assistant Compiler of Checkpoint/Restart for OpenCL Program. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICDS ![In: Data Science - 6th International Conference, ICDS 2019, Ningbo, China, May 15-20, 2019, Revised Selected Papers, pp. 574-586, 2019, Springer, 978-981-15-2809-5. The full citation details ...](Pics/full.jpeg) |
2019 |
DBLP DOI BibTeX RDF |
|
32 | Genlang Chen, Jiajian Zhang, Qiuru Lin, Hai Jiang 0003, Chaoyi Pang |
CRState: In-Kernel Checkpoint/Restart of OpenCL Program Execution on GPU. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICPADS ![In: 25th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2019, Tianjin, China, December 4-6, 2019, pp. 335-342, 2019, IEEE, 978-1-7281-2583-1. The full citation details ...](Pics/full.jpeg) |
2019 |
DBLP DOI BibTeX RDF |
|
32 | Jialing Zhang, Xiaoyan Zhuo, Aekyeung Moon, Hang Liu, Seung Woo Son 0001 |
Efficient Encoding and Reconstruction of HPC Datasets for Checkpoint/Restart. ![Search on Bibsonomy](Pics/bibsonomy.png) |
MSST ![In: 35th Symposium on Mass Storage Systems and Technologies, MSST 2019, Santa Clara, CA, USA, May 20-24, 2019, pp. 79-91, 2019, IEEE, 978-1-7281-3920-3. The full citation details ...](Pics/full.jpeg) |
2019 |
DBLP DOI BibTeX RDF |
|
32 | Masoud Gholami Estahbanati, Florian Schintke |
Multilevel Checkpoint/Restart for Large Computational Jobs on Distributed Computing Resources. ![Search on Bibsonomy](Pics/bibsonomy.png) |
SRDS ![In: 38th Symposium on Reliable Distributed Systems, SRDS 2019, Lyon, France, October 1-4, 2019, pp. 143-152, 2019, IEEE, 978-1-7281-4222-7. The full citation details ...](Pics/full.jpeg) |
2019 |
DBLP DOI BibTeX RDF |
|
32 | Thouraya Louati, Heithem Abbes, Christophe Cérin, Mohamed Jemni |
LXCloud-CR: Towards LinuX Containers Distributed Hash Table based Checkpoint-Restart. ![Search on Bibsonomy](Pics/bibsonomy.png) |
J. Parallel Distributed Comput. ![In: J. Parallel Distributed Comput. 111, pp. 187-205, 2018. The full citation details ...](Pics/full.jpeg) |
2018 |
DBLP DOI BibTeX RDF |
|
32 | Gregory Michael Price |
DMTCP Checkpoint/Restart of MPI Programs via Proxies. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/1803.09342, 2018. The full citation details ...](Pics/full.jpeg) |
2018 |
DBLP BibTeX RDF |
|
32 | Rohan Garg 0001, Apoorve Mohan, Michael B. Sullivan 0001, Gene Cooperman |
CRUM: Checkpoint-Restart Support for CUDA's Unified Memory. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/1808.00117, 2018. The full citation details ...](Pics/full.jpeg) |
2018 |
DBLP BibTeX RDF |
|
32 | Adrian Bazaga, Michal Pitonák |
Performance Evaluation of an Algorithm-based Asynchronous Checkpoint-Restart Fault Tolerant Application Using Mixed MPI/GPI-2. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/1804.11312, 2018. The full citation details ...](Pics/full.jpeg) |
2018 |
DBLP BibTeX RDF |
|
32 | Julien Adam, Jean-Baptiste Besnard, Allen D. Malony, Sameer Shende, Marc Pérache, Patrick Carribault, Julien Jaeger |
Transparent High-Speed Network Checkpoint/Restart in MPI. ![Search on Bibsonomy](Pics/bibsonomy.png) |
EuroMPI ![In: Proceedings of the 25th European MPI Users' Group Meeting, Barcelona, Spain, September 23-26, 2018, pp. 12:1-12:11, 2018, ACM. The full citation details ...](Pics/full.jpeg) |
2018 |
DBLP DOI BibTeX RDF |
|
32 | Chayawat Pechwises, Kasidit Chanchio |
A Transparent Hypervisor-level Checkpoint-Restart Mechanism for a Cluster of Virtual Machines. ![Search on Bibsonomy](Pics/bibsonomy.png) |
JCSSE ![In: 15th International Joint Conference on Computer Science and Software Engineering, JCSSE 2018, Nakhonpathom, Thailand, July 11-13, 2018, pp. 1-6, 2018, IEEE, 978-1-5386-5538-2. The full citation details ...](Pics/full.jpeg) |
2018 |
DBLP DOI BibTeX RDF |
|
32 | Rohan Garg 0001, Apoorve Mohan, Michael B. Sullivan 0001, Gene Cooperman |
CRUM: Checkpoint-Restart Support for CUDA's Unified Memory. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CLUSTER ![In: IEEE International Conference on Cluster Computing, CLUSTER 2018, Belfast, UK, September 10-13, 2018, pp. 302-313, 2018, IEEE Computer Society, 978-1-5386-8319-4. The full citation details ...](Pics/full.jpeg) |
2018 |
DBLP DOI BibTeX RDF |
|
32 | Faisal Shahzad 0001, Jonas Thies, Moritz Kreutzer, Thomas Zeiser, Georg Hager, Gerhard Wellein |
CRAFT: A library for easier application-level Checkpoint/Restart and Automatic Fault Tolerance. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/1708.02030, 2017. The full citation details ...](Pics/full.jpeg) |
2017 |
DBLP BibTeX RDF |
|
32 | Kiril Dichev, Herbert Jordan, Konstantinos Tovletoglou, Thomas Heller, Dimitrios S. Nikolopoulos, Georgios Karakonstantis, Charles Gillan |
Dependency-Aware Rollback and Checkpoint-Restart for Distributed Task-Based Runtimes. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/1705.10208, 2017. The full citation details ...](Pics/full.jpeg) |
2017 |
DBLP BibTeX RDF |
|
32 | Marcos Maronas, Sergi Mateo, Vicenç Beltran 0001, Eduard Ayguadé |
A Directive-Based Approach to Perform Persistent Checkpoint/Restart. ![Search on Bibsonomy](Pics/bibsonomy.png) |
HPCS ![In: 2017 International Conference on High Performance Computing & Simulation, HPCS 2017, Genoa, Italy, July 17-21, 2017, pp. 442-451, 2017, IEEE, 978-1-5386-3249-9. The full citation details ...](Pics/full.jpeg) |
2017 |
DBLP DOI BibTeX RDF |
|
32 | Abhinav Agrawal 0003, Gabriel H. Loh, James Tuck 0001 |
Leveraging near data processing for high-performance checkpoint/restart. ![Search on Bibsonomy](Pics/bibsonomy.png) |
SC ![In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2017, Denver, CO, USA, November 12 - 17, 2017, pp. 60, 2017, ACM, 978-1-4503-5114-0. The full citation details ...](Pics/full.jpeg) |
2017 |
DBLP DOI BibTeX RDF |
|
32 | Behnam Pourghassemi, Aparna Chandramowlishwaran |
cudaCR: An In-Kernel Application-Level Checkpoint/Restart Scheme for CUDA-Enabled GPUs. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CLUSTER ![In: 2017 IEEE International Conference on Cluster Computing, CLUSTER 2017, Honolulu, HI, USA, September 5-8, 2017, pp. 725-732, 2017, IEEE Computer Society, 978-1-5386-2326-8. The full citation details ...](Pics/full.jpeg) |
2017 |
DBLP DOI BibTeX RDF |
|
32 | Juri Schmidt |
Accelerating checkpoint/restart application performance in large-scale systems with network attached memory. ![Search on Bibsonomy](Pics/bibsonomy.png) |
|
2017 |
RDF |
|
32 | Behnam Pourghassemi |
cudaCR: An In-kernel Application-level Checkpoint/Restart Scheme for CUDA Applications. ![Search on Bibsonomy](Pics/bibsonomy.png) |
|
2017 |
RDF |
|
32 | Jiajun Cao, Kapil Arya, Rohan Garg 0001, L. Shawn Matott, Dhabaleswar K. Panda 0001, Hari Subramoni, Jérôme Vienne, Gene Cooperman |
System-level Scalable Checkpoint-Restart for Petascale Computing. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/1607.07995, 2016. The full citation details ...](Pics/full.jpeg) |
2016 |
DBLP BibTeX RDF |
|
32 | Zhengyu Chen, Jianhua Sun 0002, Hao Chen 0002 |
Optimizing Checkpoint Restart with Data Deduplication. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Sci. Program. ![In: Sci. Program. 2016, pp. 9315493:1-9315493:11, 2016. The full citation details ...](Pics/full.jpeg) |
2016 |
DBLP DOI BibTeX RDF |
|
32 | Jiajun Cao, Kapil Arya, Rohan Garg 0001, L. Shawn Matott, Dhabaleswar K. Panda 0001, Hari Subramoni, Jérôme Vienne, Gene Cooperman |
System-Level Scalable Checkpoint-Restart for Petascale Computing. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICPADS ![In: 22nd IEEE International Conference on Parallel and Distributed Systems, ICPADS 2016, Wuhan, China, December 13-16, 2016, pp. 932-941, 2016, IEEE Computer Society, 978-1-5090-4457-3. The full citation details ...](Pics/full.jpeg) |
2016 |
DBLP DOI BibTeX RDF |
|
32 | Scott Levy, Kurt B. Ferreira |
An Examination of the Impact of Failure Distribution on Coordinated Checkpoint/Restart. ![Search on Bibsonomy](Pics/bibsonomy.png) |
FTXS@HPDC ![In: Proceedings of the ACM Workshop on Fault-Tolerance for HPC at Extreme Scale, FTXS@HPDC 2016, Kyoto, Japan, May 31, 2016, pp. 35-42, 2016, ACM, 978-1-4503-4349-7. The full citation details ...](Pics/full.jpeg) |
2016 |
DBLP DOI BibTeX RDF |
|
32 | Javier Arias Moreno, Osman S. Unsal, Jesús Labarta, Adrián Cristal |
NanoCheckpoints: A Task-Based Asynchronous Dataflow Framework for Efficient and Scalable Checkpoint/Restart. ![Search on Bibsonomy](Pics/bibsonomy.png) |
PDP ![In: 23rd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, PDP 2015, Turku, Finland, March 4-6, 2015, pp. 99-102, 2015, IEEE Computer Society, 978-1-4799-8491-6. The full citation details ...](Pics/full.jpeg) |
2015 |
DBLP DOI BibTeX RDF |
|
32 | Naoto Sasaki, Kento Sato, Toshio Endo, Satoshi Matsuoka |
Exploration of Lossy Compression for Application-Level Checkpoint/Restart. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IPDPS ![In: 2015 IEEE International Parallel and Distributed Processing Symposium, IPDPS 2015, Hyderabad, India, May 25-29, 2015, pp. 914-922, 2015, IEEE Computer Society, 978-1-4799-8649-1. The full citation details ...](Pics/full.jpeg) |
2015 |
DBLP DOI BibTeX RDF |
|
32 | Jiajun Cao, Gregory Kerr, Kapil Arya, Gene Cooperman |
Transparent checkpoint-restart over infiniband. ![Search on Bibsonomy](Pics/bibsonomy.png) |
HPDC ![In: The 23rd International Symposium on High-Performance Parallel and Distributed Computing, HPDC'14, Vancouver, BC, Canada - June 23 - 27, 2014, pp. 13-24, 2014, ACM, 978-1-4503-2749-7. The full citation details ...](Pics/full.jpeg) |
2014 |
DBLP DOI BibTeX RDF |
|
32 | Ajay Saini, Arash Rezaei, Frank Mueller 0001, Paul Hargrove, Eric Roman |
Affinity-aware checkpoint restart. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Middleware ![In: Proceedings of the 15th International Middleware Conference, Bordeaux, France, December 8-12, 2014, pp. 121-132, 2014, ACM, 978-1-4503-2785-5. The full citation details ...](Pics/full.jpeg) |
2014 |
DBLP DOI BibTeX RDF |
|
32 | Cristiano Giuffrida, Calin Iorgulescu, Andrew S. Tanenbaum |
Mutable checkpoint-restart: automating live update for generic server programs. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Middleware ![In: Proceedings of the 15th International Middleware Conference, Bordeaux, France, December 8-12, 2014, pp. 133-144, 2014, ACM, 978-1-4503-2785-5. The full citation details ...](Pics/full.jpeg) |
2014 |
DBLP DOI BibTeX RDF |
|
32 | Nosayba El-Sayed, Bianca Schroeder |
Checkpoint/restart in practice: When 'simple is better'. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CLUSTER ![In: 2014 IEEE International Conference on Cluster Computing, CLUSTER 2014, Madrid, Spain, September 22-26, 2014, pp. 84-92, 2014, IEEE Computer Society, 978-1-4799-5548-0. The full citation details ...](Pics/full.jpeg) |
2014 |
DBLP DOI BibTeX RDF |
|
32 | Faisal Shahzad 0001, Markus Wittmann, Moritz Kreutzer, Thomas Zeiser, Georg Hager, Gerhard Wellein |
A Survey of Checkpoint/Restart Techniques on Distributed Memory Systems. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Parallel Process. Lett. ![In: Parallel Process. Lett. 23(4), 2013. The full citation details ...](Pics/full.jpeg) |
2013 |
DBLP DOI BibTeX RDF |
|
32 | Ifeanyi P. Egwutuoha, David Levy 0001, Bran Selic, Shiping Chen 0001 |
A survey of fault tolerance mechanisms and checkpoint/restart implementations for high performance computing systems. ![Search on Bibsonomy](Pics/bibsonomy.png) |
J. Supercomput. ![In: J. Supercomput. 65(3), pp. 1302-1326, 2013. The full citation details ...](Pics/full.jpeg) |
2013 |
DBLP DOI BibTeX RDF |
|
32 | Bogdan Nicolae, Franck Cappello |
BlobCR: Virtual disk based checkpoint-restart for HPC applications on IaaS clouds. ![Search on Bibsonomy](Pics/bibsonomy.png) |
J. Parallel Distributed Comput. ![In: J. Parallel Distributed Comput. 73(5), pp. 698-711, 2013. The full citation details ...](Pics/full.jpeg) |
2013 |
DBLP DOI BibTeX RDF |
|
32 | Jiajun Cao, Kapil Arya, Gene Cooperman |
Transparent Checkpoint-Restart over InfiniBand. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/1312.3938, 2013. The full citation details ...](Pics/full.jpeg) |
2013 |
DBLP BibTeX RDF |
|
32 | Kapil Arya, Gene Cooperman, Andrea Dotti, Peter Elmer |
Use of Checkpoint-Restart for Complex HEP Software on Traditional Architectures and Intel MIC. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/1311.0272, 2013. The full citation details ...](Pics/full.jpeg) |
2013 |
DBLP BibTeX RDF |
|
32 | Samaneh Kazemi, Rohan Garg 0001, Gene Cooperman |
Transparent Checkpoint-Restart for Hardware-Accelerated 3D Graphics. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/1312.6650, 2013. The full citation details ...](Pics/full.jpeg) |
2013 |
DBLP BibTeX RDF |
|
Displaying result #1 - #100 of 163 (100 per page; Change: ) Pages: [ 1][ 2][ >>] |
|