The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Searching for Allreduce with no syntactic query expansion in all metadata.

Publication years (Num. hits)
2003-2008 (15) 2010-2019 (18) 2020-2021 (15) 2022-2023 (15) 2024 (5)
Publication types (Num. hits)
article(23) inproceedings(45)
Venues (Conferences, Journals, ...)
CoRR(10) CCGrid(4) CLUSTER(3) SC(3) APNet(2) Clust. Comput.(2) Hot Interconnects(2) ICPADS(2) ICPP(2) ICS(2) IPDPS(2) Parallel Comput.(2) PPoPP(2) PVM/MPI(2) CANDAR Workshops(1) Computing(1) More (+10 of total 42)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
The graphs summarize 12 occurrences of 9 keywords

Results
Found 68 publication records. Showing 68 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
89Amith R. Mamidala, Jiuxing Liu, Dhabaleswar K. Panda 0001 Efficient Barrier and Allreduce on Infiniband clusters using multicast and adaptive algorithms. Search on Bibsonomy CLUSTER The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
84Rinku Gupta, Pavan Balaji, Dhabaleswar K. Panda 0001, Jarek Nieplocha Efficient Collective Operations Using Remote Memory Operations on VIA-Based Clusters. Search on Bibsonomy IPDPS The full citation details ... 2003 DBLP  DOI  BibTeX  RDF
74Motohiko Matsuda, Tomohiro Kudoh, Yuetsu Kodama, Ryousei Takano, Yutaka Ishikawa The design and implementation of MPI collective operations for clusters in long-and-fast networks. Search on Bibsonomy Clust. Comput. The full citation details ... 2008 DBLP  DOI  BibTeX  RDF Allreduce, Grid, Broadcast, Message passing interface (MPI), Wide-area network, Collective communication
63Lars Ailo Bongo, Otto J. Anshus, John Markus Bjørndalen, Tore Larsen Extending Collective Operations with Application Semantics for Improving Multi-Cluster Performance. Search on Bibsonomy ISPDC/HeteroPar The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
63Lars Ailo Bongo, Otto J. Anshus, John Markus Bjørndalen Collective Communication Performance Analysis Within the Communication System. Search on Bibsonomy Euro-Par The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
52Peng Liu, Jintao Peng, Jie Liu, Lihua Chi TH-Allreduce: Optimizing Small Data Allreduce Operation on Tianhe System. Search on Bibsonomy ICPADS The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
47George Almási 0001, Gábor Dózsa, C. Christopher Erway, Burkhard D. Steinmacher-Burow Efficient Implementation of Allreduce on BlueGene/L Collective Network. Search on Bibsonomy PVM/MPI The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
42Espen Skjelnes Johnsen, John Markus Bjørndalen, Otto J. Anshus CoMPI- Configuration of Collective Operations in LAM/MPI Using the Scheme Programming Language. Search on Bibsonomy PARA The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
42Motohiko Matsuda, Tomohiro Kudoh, Yuetsu Kodama, Ryousei Takano, Yutaka Ishikawa Efficient MPI Collective Operations for Clusters in Long-and-Fast Networks. Search on Bibsonomy CLUSTER The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
33Keith D. Underwood, Jerrie Coffman, Roy Larsen, K. Scott Hemmert, Brian W. Barrett, Ron Brightwell, Michael J. Levenhagen Enabling Flexible Collective Communication Offload with Triggered Operations. Search on Bibsonomy Hot Interconnects The full citation details ... 2011 DBLP  DOI  BibTeX  RDF Allreduce, MPI, collective, offload
26Emin Nuriyev, Ravi Reddy Manumachu, Samar Aseeri, Mahendra K. Verma, Alexey L. Lastovetsky SUARA: A scalable universal allreduce communication algorithm for acceleration of parallel deep learning applications. Search on Bibsonomy J. Parallel Distributed Comput. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
26Daniele De Sensi, Tommaso Bonato, David Saam, Torsten Hoefler Swing: Short-cutting Rings for Higher Bandwidth Allreduce. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
26Daniele De Sensi, Edgar Costa Molero, Salvatore Di Girolamo, Laurent Vanbever, Torsten Hoefler Canary: Congestion-aware in-network allreduce using dynamic trees. Search on Bibsonomy Future Gener. Comput. Syst. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
26Guozheng Wang, Yongmei Lei, Zeyu Zhang, Cunlu Peng 2D-THA-ADMM: communication efficient distributed ADMM algorithm framework based on two-dimensional torus hierarchical AllReduce. Search on Bibsonomy Int. J. Mach. Learn. Cybern. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
26Daniele De Sensi, Tommaso Bonato, David Saam, Torsten Hoefler Swing: Short-cutting Rings for Higher Bandwidth Allreduce. Search on Bibsonomy NSDI The full citation details ... 2024 DBLP  BibTeX  RDF
26Guozheng Wang, Yongmei Lei, Zeyu Zhang, Cunlu Peng A Communication Efficient ADMM-based Distributed Algorithm Using Two-Dimensional Torus Grouping AllReduce. Search on Bibsonomy Data Sci. Eng. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
26Ertza Warraich, Omer Shabtai, Khalid Manaa, Shay Vargaftik, Yonatan Piasetzky, Matty Kadosh, Lalith Suresh, Muhammad Shahbaz 0001 Ultima: Robust and Tail-Optimal AllReduce for Distributed Deep Learning in the Cloud. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
26Daniele De Sensi, Edgar Costa Molero, Salvatore Di Girolamo, Laurent Vanbever, Torsten Hoefler Canary: Congestion-Aware In-Network Allreduce Using Dynamic Trees. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
26Adrián Castelló 0001, Mar Catalán, Manuel F. Dolz, Enrique S. Quintana-Ortí, José Duato Analyzing the impact of the MPI allreduce in distributed training of convolutional neural networks. Search on Bibsonomy Computing The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
26Ruiqi Wang, Dezun Dong, Fei Lei, Junchao Ma, Ke Wu, Kai Lu Roar: A Router Microarchitecture for In-network Allreduce. Search on Bibsonomy ICS The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
26Peng Liu, Jintao Peng, Jie Liu 0002, Min Xie, Liuhua Chi GLEX_Allreduce: Optimization for medium and small message of Allreduce on Tianhe system. Search on Bibsonomy ICPADS The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
26Chang Chen, Min Li, Chao Yang bbTopk: Bandwidth-Aware Sparse Allreduce with Blocked Sparsification for Efficient Distributed Training. Search on Bibsonomy ICDCS The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
26Marcin Chrapek, Mikhail Khalilov, Torsten Hoefler HEAR: Homomorphically Encrypted Allreduce. Search on Bibsonomy SC The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
26Kartik Lakhotia, Kelly Isham, Laura Monroe, Maciej Besta, Torsten Hoefler, Fabrizio Petrini In-network Allreduce with Multiple Spanning Trees on PolarFly. Search on Bibsonomy SPAA The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
26Kartik Lakhotia, Fabrizio Petrini, Rajgopal Kannan, Viktor K. Prasanna Accelerating Allreduce With In-Network Reduction on Intel PIUMA. Search on Bibsonomy IEEE Micro The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
26Shigang Li 0002, Torsten Hoefler Near-Optimal Sparse Allreduce for Distributed Deep Learning. Search on Bibsonomy CoRR The full citation details ... 2022 DBLP  BibTeX  RDF
26Sam White, Laxmikant V. Kalé Optimizing Non-commutative Allreduce Over Virtualized, Migratable MPI Ranks. Search on Bibsonomy IPDPS Workshops The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
26Zeyu Zhang, Yongmei Lei, Dongxia Wang, Guozheng Wang Distributed ADMM Based on Sparse Computation and Allreduce Communication. Search on Bibsonomy ISPA/BDCloud/SocialCom/SustainCom The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
26Shigang Li 0002, Torsten Hoefler Near-optimal sparse allreduce for distributed deep learning. Search on Bibsonomy PPoPP The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
26Truong Thao Nguyen, Mohamed Wahib, Ryousei Takano Efficient MPI-AllReduce for large-scale deep learning on GPU-clusters. Search on Bibsonomy Concurr. Comput. Pract. Exp. The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
26Dongxia Wang 0003, Yongmei Lei, Jinyang Xie, Guozheng Wang HSAC-ALADMM: an asynchronous lazy ADMM algorithm based on hierarchical sparse allreduce communication. Search on Bibsonomy J. Supercomput. The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
26Yao Liu 0006, Junyi Zhang 0005, Shuo Liu 0002, Qiaoling Wang, Wangchen Dai, Ray Chak-Chung Cheung Scalable Fully Pipelined Hardware Architecture for In-Network Aggregated AllReduce Communication. Search on Bibsonomy IEEE Trans. Circuits Syst. I Regul. Pap. The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
26Andreas Jocksch, Noé Ohana, Emmanuel Lanti, Eirini Koutsaniti, Vasileios Karakasis, Laurent Villard An optimisation of allreduce communication in message-passing systems. Search on Bibsonomy Parallel Comput. The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
26Daniele De Sensi, Salvatore Di Girolamo, Saleh Ashkboos, Shigang Li 0002, Torsten Hoefler Flare: Flexible In-Network Allreduce. Search on Bibsonomy CoRR The full citation details ... 2021 DBLP  BibTeX  RDF
26Adrián Castelló 0001, Enrique S. Quintana-Ortí, José Duato Accelerating distributed deep neural network training with pipelined MPI allreduce. Search on Bibsonomy Clust. Comput. The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
26Ido Hakimi, Rotem Zamir Aviv, Kfir Y. Levy, Assaf Schuster LAGA: Lagged AllReduce with Gradient Accumulation for Minimal Idle Time. Search on Bibsonomy ICDM The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
26Adrián Castelló 0001, Mar Catalán, Manuel F. Dolz, José I. Mestre, Enrique S. Quintana-Ortí, José Duato Evaluation of MPI Allreduce for Distributed Training of Convolutional Neural Networks. Search on Bibsonomy PDP The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
26Truong Thao Nguyen, Mohamed Wahib An Allreduce Algorithm and Network Co-design for Large-Scale Training of Distributed Deep Learning. Search on Bibsonomy CCGRID The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
26Akira Nukada Performance Optimization of Allreduce Operation for Multi-GPU Systems. Search on Bibsonomy IEEE BigData The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
26Daniele De Sensi, Salvatore Di Girolamo, Saleh Ashkboos, Shigang Li 0002, Torsten Hoefler Flare: flexible in-network allreduce. Search on Bibsonomy SC The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
26Jiayu Wang, Peng Liu, Zehua Guo 0001, Sen Liu 0002, Chao Yao Exploring the Impact of Attacks on Ring AllReduce. Search on Bibsonomy APNet The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
26Andreas Jocksch, Noé Ohana, Emmanuel Lanti, Vasileios Karakasis, Laurent Villard Optimised allgatherv, reduce_scatter and allreduce communication in message-passing systems. Search on Bibsonomy CoRR The full citation details ... 2020 DBLP  BibTeX  RDF
26Dmitry Kolmakov, Xuecang Zhang A Generalization of the Allreduce Operation. Search on Bibsonomy CoRR The full citation details ... 2020 DBLP  BibTeX  RDF
26Xinchen Wan, Hong Zhang 0025, Hao Wang 0116, Shuihai Hu, Junxue Zhang 0001, Kai Chen 0005 RAT - Resilient Allreduce Tree for Distributed Machine Learning. Search on Bibsonomy APNet The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
26Zehua Cheng, Zhenghua Xu Bandwidth Reduction using Importance Weighted Pruning on Ring AllReduce. Search on Bibsonomy CoRR The full citation details ... 2019 DBLP  BibTeX  RDF
26Amanda Bienz, Luke N. Olson, William D. Gropp Node-Aware Improvements to Allreduce. Search on Bibsonomy CoRR The full citation details ... 2019 DBLP  BibTeX  RDF
26Truong Thao Nguyen, Mohamed Wahib, Ryousei Takano Topology-aware Sparse Allreduce for Large-scale Deep Learning. Search on Bibsonomy IPCCC The full citation details ... 2019 DBLP  DOI  BibTeX  RDF
26Yuichiro Ueno, Rio Yokota Exhaustive Study of Hierarchical AllReduce Patterns for Large Messages Between GPUs. Search on Bibsonomy CCGRID The full citation details ... 2019 DBLP  DOI  BibTeX  RDF
26Truong Thao Nguyen, Mohamed Wahib, Ryousei Takano Hierarchical Distributed-Memory Multi-Leader MPI-Allreduce for Deep Learning Workloads. Search on Bibsonomy CANDAR Workshops The full citation details ... 2018 DBLP  DOI  BibTeX  RDF
26Martin Ruefenacht, Mark Bull, Stephen Booth Generalisation of recursive doubling for AllReduce: Now with simulation. Search on Bibsonomy Parallel Comput. The full citation details ... 2017 DBLP  DOI  BibTeX  RDF
26Jesús M. Álvarez Llorente, Juan Carlos Díaz Martín, Juan A. Rico-Gallego Formal modeling and performance evaluation of a run-time rank remapping technique in Broadcast, Allgather and Allreduce MPI collective operations. Search on Bibsonomy CCGrid The full citation details ... 2017 DBLP  DOI  BibTeX  RDF
26Martin Ruefenacht, Mark Bull, Stephen Booth Generalisation of Recursive Doubling for AllReduce. Search on Bibsonomy EuroMPI The full citation details ... 2016 DBLP  DOI  BibTeX  RDF
26Patrick M. Widener, Kurt B. Ferreira, Scott Levy, Torsten Hoefler Exploring the effect of noise on the performance benefit of nonblocking allreduce. Search on Bibsonomy EuroMPI/ASIA The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
26Keichi Takahashi, Dashdavaa Khureltulga, Yasuhiro Watashiba, Yoshiyuki Kido, Susumu Date, Shinji Shimojo Performance evaluation of SDN-enhanced MPI allreduce on a cluster system with fat-tree interconnect. Search on Bibsonomy HPCS The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
26Lena Oden, Benjamin Klenk, Holger Fröning Energy-Efficient Collective Reduce and Allreduce Operations on Distributed GPUs. Search on Bibsonomy CCGRID The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
26Huasha Zhao, John F. Canny Kylix: A Sparse Allreduce for Commodity Clusters. Search on Bibsonomy ICPP The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
26Huasha Zhao, John F. Canny Sparse Allreduce: Efficient Scalable Communication for Power-Law Data. Search on Bibsonomy CoRR The full citation details ... 2013 DBLP  BibTeX  RDF
26Nongda Hu, Dawei Wang, Zheng Cao, Xuejun An, Ninghui Sun Accelerating Allreduce Operation: A Switch-Based Solution. Search on Bibsonomy ICCCN The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
26Krishna Chaitanya Kandalla, Akshay Venkatesh, Khaled Hamidouche, Sreeram Potluri, Devendar Bureddy, Dhabaleswar K. Panda 0001 Designing Optimized MPI Broadcast and Allreduce for Many Integrated Core (MIC) InfiniBand Clusters. Search on Bibsonomy Hot Interconnects The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
26Krishna Chaitanya Kandalla, Ulrike Meier Yang, Jeff Keasler, Tzanio V. Kolev, Adam Moody, Hari Subramoni, Karen Tomko, Jérôme Vienne, Bronis R. de Supinski, Dhabaleswar K. Panda 0001 Designing Non-blocking Allreduce with Collective Offload on InfiniBand Clusters: A Case Study with Conjugate Gradient Solvers. Search on Bibsonomy IPDPS The full citation details ... 2012 DBLP  DOI  BibTeX  RDF
26Toshiyuki Imamura Recursive multi-factoring algorithm for MPI allreduce. Search on Bibsonomy Parallel and Distributed Computing and Networks The full citation details ... 2007 DBLP  BibTeX  RDF
21Nikhil Jain, Yogish Sabharwal Optimal bucket algorithms for large MPI collectives on torus interconnects. Search on Bibsonomy ICS The full citation details ... 2010 DBLP  DOI  BibTeX  RDF communication, MPI, collective, torus network
21Sameer Kumar 0001, Gábor Dózsa, Jeremy Berg, Bob Cernohous, Douglas Miller, Joe Ratterman, Brian E. Smith, Philip Heidelberger Architecture of the Component Collective Messaging Interface. Search on Bibsonomy PVM/MPI The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
21Sundeep Narravula, Amith R. Mamidala, Abhinav Vishnu, Gopalakrishnan Santhanaraman, Dhabaleswar K. Panda 0001 High Performance MPI over iWARP: Early Experiences. Search on Bibsonomy ICPP The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
21José Carlos Sancho, Darren J. Kerbyson, Kevin J. Barker Efficient offloading of collective communications in large-scale systems. Search on Bibsonomy CLUSTER The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
21Trammell Hudson, Ron Brightwell Poster reception - Network performance impact of a lightweight Linux for Cray XT3 compute nodes. Search on Bibsonomy SC The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
21Ernie Chan, Robert A. van de Geijn, William Gropp, Rajeev Thakur Collective communication on architectures that support simultaneous communication over multiple links. Search on Bibsonomy PPoPP The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
21Dongyoung Kim, Dongseung Kim Enhanced Collective Communication Functions Using Factorization and Pairwise-exchange Communication. Search on Bibsonomy ICPADS (1) The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
Displaying result #1 - #68 of 68 (100 per page; Change: )
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by L3S.
Previously maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.
open data data released under the ODC-BY 1.0 license