|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 2154 occurrences of 1058 keywords
|
|
|
Results
Found 4206 publication records. Showing 4206 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
16 | Jing Han, Wei Zhang 0049, Huiling Shi, Yan Zhou, Chang Tang, Jingshan Pan |
Accelerate Supercomputing through Cross-Region Interconnection. |
SmartWorld/UIC/ScalCom/DigitalTwin/PriComp/Meta |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Gabriele Cavallaro, Morris Riedel, Thomas Lippert, Kristel Michielsen |
Hybrid Quantum-Classical Workflows in Modular Supercomputing Architectures with the Julich Unified Infrastructure for Quantum Computing. |
IGARSS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Sarah Neuwirth |
Assessment of the I/O and Storage Subsystem in Modular Supercomputing Architectures. |
CLUSTER |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Susumu Yamada, Toshiyuki Imamura, Masahiko Machida |
High Performance Parallel LOBPCG Method for Large Hamiltonian Derived from Hubbard Model on Multi-GPU Systems. |
SCFA |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Atsushi Hori, Kaiming Ouyang, Balazs Gerofi, Yutaka Ishikawa |
On the Difference Between Shared Memory and Shared Address Space in HPC Communication. |
SCFA |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Pengyu Wang, Zhong Chen |
Vapor Condensation Under Electric Field: A Study Using Molecular Dynamics Simulation. |
SCFA |
2022 |
DBLP DOI BibTeX RDF |
|
16 | George S. Markomanolis, Aksel Alpay, Jeff Young 0001, Michael Klemm, Nicholas Malaya, Aniello Esposito, Jussi Heikonen, Sergey I. Bastrakov, Alexander Debus, Thomas Kluge, Klaus Steiniger, Jan Stephan, René Widera, Michael Bussmann |
Evaluating GPU Programming Models for the LUMI Supercomputer. |
SCFA |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Abdullah Bittar, Ziqiang Wang, Amir Aghasharif, Changcheng Huang, Gauravdeep Shami, Marc Lyonnais, Rodney Wilson |
Service Function Chaining Design & Implementation Using Network Service Mesh in Kubernetes. |
SCFA |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Jakub Kopec |
Evaluating Methods of Transferring Large Datasets. |
SCFA |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Shao-Hen Chiew, Leong-Chuan Kwek, Chee-Kong Lee |
Exploring the Dynamics of Quantum Information in Many-Body Localised Systems with High Performance Computing. |
SCFA |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Jie Yao, K. S. Yeo |
The Effect of Wing Mass and Wing Elevation Motion During Insect Forward Flight. |
SCFA |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Nicolas Grenèche, Christophe Cérin |
Autoscaling of Containerized HPC Clusters in the Cloud. |
SuperCompCloud |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Simon Caton, Matt Baughman, Christian Haas 0003, Ryan Chard, Ian T. Foster, Kyle Chard |
Assessing the Current State of AWS Spot Market Forecastability. |
SuperCompCloud |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Hossein Golestani, Rathijit Sen, Vinson Young, Gagan Gupta |
Calipers: a criticality-aware framework for modeling processor performance. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Minh Pham, Hao Li 0071, Yongke Yuan, Chengcheng Mou, Kandethody Ramachandran, Zichen Xu 0001, Yicheng Tu |
Dynamic memory management in massively parallel systems: a case on GPUs. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Cheng Tan 0002, Thierry Tambe, Jeff Jun Zhang, Bo Fang, Tong Geng, Gu-Yeon Wei, David Brooks 0001, Antonino Tumeo, Ganesh Gopalakrishnan, Ang Li 0006 |
ASAP: automatic synthesis of area-efficient and precision-aware CGRAs. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Taha Shahroodi, Mahdi Zahedi, Abhairaj Singh, Stephan Wong, Said Hamdioui |
KrakenOnMem: a memristor-augmented HW/SW framework for taxonomic profiling. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Ardhi Wiratama Baskara Yudha, Jake Meyer, Shougang Yuan, Huiyang Zhou, Yan Solihin |
LITE: a low-cost practical inter-operable GPU TEE. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Zixuan Ma, Haojie Wang, Guanyu Feng, Chen Zhang, Lei Xie, Jiaao He, Shengqi Chen 0001, Jidong Zhai |
Efficiently emulating high-bitwidth computation with low-bitwidth hardware. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Chengming Zhang 0006, Sian Jin, Tong Geng, Jiannan Tian, Ang Li 0006, Dingwen Tao |
CEAZ: accelerating parallel I/O via hardware-algorithm co-designed adaptive lossy compression. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Heng Zhang 0005, Lingda Li, Hang Liu 0001, Donglin Zhuang, Rui Liu 0002, Chengying Huan, Shuang Song, Dingwen Tao, Yongchao Liu, Charles He, Yanjun Wu, Shuaiwen Leon Song |
Bring orders into uncertainty: enabling efficient uncertain graph processing via novel path sampling on multi-accelerator systems. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Oliver Rausch, Tal Ben-Nun, Nikoli Dryden, Andrei Ivanov, Shigang Li 0002, Torsten Hoefler |
A data-centric optimization framework for machine learning. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Wesley Smith, Aidan Goldfarb, Chen Ding 0001 |
Beyond time complexity: data movement complexity analysis for matrix multiplication. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Jonathon M. Anderson, Yumeng Liu, John M. Mellor-Crummey |
Preparing for performance analysis at exascale. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Keren Zhou 0001, Jonathon M. Anderson, Xiaozhu Meng, John M. Mellor-Crummey |
Low overhead and context sensitive profiling of CPU-accelerated applications. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Alexandru Calotoiu, Tal Ben-Nun, Grzegorz Kwasniewski, Johannes de Fine Licht, Timo Schneider, Philipp Schaad, Torsten Hoefler |
Lifting C semantics for dataflow optimization. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Jiangsu Du, Jiazhi Jiang, Yang You 0001, Dan Huang, Yutong Lu |
Handling heavy-tailed input of transformer inference on GPUs. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Arthur Francisco Lorenzon, Sandro Matheus V. N. Marques, Antoni C. Navarro, Vicenç Beltran 0001 |
Seamless optimization of the GEMM kernel for task-based programming models. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Kamalakkannan Kamalavasan, Gihan R. Mudalige, István Z. Reguly, Suhaib A. Fahmy |
High throughput multidimensional tridiagonal system solvers on FPGAs. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Mohsen Koohi Esfahani, Peter Kilpatrick, Hans Vandierendonck |
MASTIFF: structure-aware minimum spanning tree/forest. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Adhitha Dias, Kirshanthan Sundararajah, Charitha Saumya, Milind Kulkarni 0001 |
SparseLNR: accelerating sparse tensor computations using loop nest restructuring. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Sharjeel Khan, Bodhisatwa Chatterjee, Santosh Pande |
VICO: demand-driven verification for improving compiler optimizations. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Andreas Abel 0002, Jan Reineke 0001 |
uiCA: accurate throughput prediction of basic blocks on recent intel microarchitectures. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Khalid Ayedh Alharthi, Arshad Jhumka, Sheng Di, Franck Cappello |
Clairvoyant: a log-based transformer-decoder for failure prediction in large-scale systems. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Mohammad Almasri, Izzat El Hajj, Rakesh Nagi, Jinjun Xiong, Wen-Mei Hwu |
Parallel K-clique counting on GPUs. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Daeyoung Park, Heehoon Kim, Jinpyo Kim, Taehyun Kim 0002, Jaejin Lee |
SnuQS: scaling quantum circuit simulation using storage devices. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Serif Yesil, José E. Moreira, Josep Torrellas |
Dense dynamic blocks: optimizing SpMM for processors with vector and matrix units using machine learning techniques. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | André Müller, Bertil Schmidt, Richard Membarth, Roland Leißa, Sebastian Hack |
AnySeq/GPU: a novel approach for faster sequence alignment on GPUs. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Zhongzhe Hu, Junmin Xiao, Zheye Deng, Mingyi Li, Kewei Zhang, Xiaoyang Zhang, Ke Meng, Ninghui Sun, Guangming Tan |
MegTaiChi: dynamic tensor-based memory management optimization for DNN training. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Xiaoyan Liu, Yi Liu 0013, Hailong Yang, Jianjin Liao, Mingzhen Li, Zhongzhi Luan, Depei Qian |
Toward accelerated stencil computation by adapting tensor core unit on GPU. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Mingzhe Liu, Haikun Liu, Chencheng Ye, Xiaofei Liao, Hai Jin 0001, Yu Zhang 0027, Ran Zheng, Liting Hu |
Towards low-latency I/O services for mixed workloads using ultra-low latency SSDs. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Jinpyo Kim, Hyungdal Kwon, Jintaek Kang, Jihwan Park, Seungwook Lee, Jaejin Lee |
SnuHPL: high performance LINPACK for heterogeneous GPUs. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Shulai Zhang, Weihao Cui, Quan Chen 0002, Zhengnian Zhang, Yue Guan, Jingwen Leng, Chao Li 0009, Minyi Guo |
PAME: precision-aware multi-exit DNN serving for reducing latencies of batched inferences. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Zhuoran Ji, Cho-Li Wang |
Efficient exact K-nearest neighbor graph construction for billion-scale datasets using GPUs with tensor cores. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Hugo Tárrega, Alejandro Valero, Vicente Lorente, Salvador Petit, Julio Sahuquillo |
Fast-track cache: a huge racetrack memory L1 data cache. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Guangnan Feng, Dezun Dong, Yutong Lu |
Optimized MPI collective algorithms for dragonfly topology. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Bagus Hanindhito, Dimitrios Gourounas, Arash Fathi, Dimitar Trenev, Andreas Gerstlauer, Lizy K. John |
GAPS: GPU-acceleration of PDE solvers for wave simulation. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Larissa Schmid, Marcin Copik, Alexandru Calotoiu, Dominik Werle, Andreas Reiter, Michael Selzer, Anne Koziolek, Torsten Hoefler |
Performance-detective: automatic deduction of cheap and accurate performance models. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Andy Nguyen, Ahmed E. Helal, Fabio Checconi, Jan Laukemann, Jesmin Jahan Tithi, Yongseok Soh, Teresa M. Ranadive, Fabrizio Petrini, Jee W. Choi |
Efficient, out-of-memory sparse MTTKRP on massively parallel architectures. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Apostolos Kokolis, Namrata Mantri, Shrikanth Ganapathy, Josep Torrellas, John Kalamatianos |
Cloak: tolerating non-volatile cache read latency. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Hans Vandierendonck |
Software-defined floating-point number formats and their application to graph processing. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Shihui Song, Peng Jiang 0004 |
Rethinking graph data placement for graph neural network training on multiple GPUs. |
ICS |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Álvaro Fernández 0004, Camino Fernández 0001, José Ángel Miguel-Dávila, Miguel Á. Conde |
Integrating supercomputing clusters into education: a case study in biotechnology. |
J. Supercomput. |
2021 |
DBLP DOI BibTeX RDF |
|
16 | |
High-performance in classification of heart disease using advanced supercomputing technique with cluster-based enhanced deep genetic algorithm. |
J. Supercomput. |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Sinan G. Aksoy, Paul J. Bruillard, Stephen J. Young, Mark Raugas |
Ramanujan graphs and the spectral gap of supercomputing topologies. |
J. Supercomput. |
2021 |
DBLP DOI BibTeX RDF |
|
16 | David A. Bader |
Linux and Supercomputing: How My Passion for Building COTS Systems Led to an HPC Revolution. |
IEEE Ann. Hist. Comput. |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Rollin C. Thomas, Shreyas Cholia, Kathryn M. Mohror, John M. Shalf |
Interactive Supercomputing With Jupyter. |
Comput. Sci. Eng. |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Michael Bauer 0001, Wonchan Lee, Manolis Papadakis, Marcin Zalewski, Michael Garland, Konrad Hinsen, Anshu Dubey |
Supercomputing in Python With Legate. |
Comput. Sci. Eng. |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Josh Vincent Vermaas, Ada Sedova, Matthew B. Baker, Swen Boehm, David M. Rogers, Jeff Larkin, Jens Glaser, Micholas Dean Smith, Oscar R. Hernandez, Jeremy C. Smith |
Supercomputing Pipelines Search for Therapeutics Against COVID-19. |
Comput. Sci. Eng. |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Jim Samuel, Margaret Brennan-Tonetta, Yana Samuel, Pradeep Subedi, Jack Smith |
Strategies for Democratization of Supercomputing: Availability, Accessibility and Usability of High Performance Computing for Education and Practice of Big Data Analytics. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
16 | Bryce A. Primavera, Jeffrey M. Shainline |
Considerations for neuromorphic supercomputing in semiconducting and superconducting optoelectronic hardware. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
16 | Kaira Samuel, Jeremy Kepner, Michael Jones 0001, Lauren Milechin, Vijay Gadepally, William Arcand, David Bestor, William Bergeron, Chansup Byun, Matthew Hubbell, Michael Houle 0001, Anna Klein, Victor Lopez, Julie Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Sid Samsi, Charles Yee, Peter Michaleas |
Supercomputing Enabled Deployable Analytics for Disaster Response. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
16 | Giovanni Agosta, Daniele Cattaneo 0002, William Fornaciari, Andrea Galimberti, Giuseppe Massari, Federico Reghenzani, Federico Terraneo, Davide Zoni, Carlo Brandolese, Massimo Celino, Francesco Iannone, Paolo Palazzari, Giuseppe Zummo, Massimo Bernaschi, Pasqua D'Ambra, Sergio Saponara, Marco Danelutto, Massimo Torquati, Marco Aldinucci, Yasir Arfat, Barbara Cantalupo, Iacopo Colonnelli, Roberto Esposito, Alberto Riccardo Martinelli, Gianluca Mittone, Olivier Beaumont, Bérenger Bramas, Lionel Eyraud-Dubois, Brice Goglin, Abdou Guermouche, Raymond Namyst, Samuel Thibault, Antonio Filgueras, Miquel Vidal, Carlos Álvarez 0001, Xavier Martorell, Ariel Oleksiak, Michal Kulczewski, Alessandro Lonardo, Piero Vicini, Francesca Lo Cicero, Francesco Simula, Andrea Biagioni, Paolo Cretaro, Ottorino Frezza, Pier Stanislao Paolucci, Matteo Turisini, Francesco Giacomini, Tommaso Boccali, Simone Montangero, Roberto Ammendola |
TEXTAROSSA: Towards EXtreme scale Technologies and Accelerators for euROhpc hw/Sw Supercomputing Applications for exascale. |
DSD |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Gilad Shainer, Richard L. Graham, Chris J. Newburn, Oscar R. Hernandez, Gil Bloch, Tom Gibbs, Jack C. Wells |
NVIDIA's Cloud Native Supercomputing. |
SMC |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Morris Riedel, Rocco Sedona, Chadi Barakat, Pétur Helgi Einarsson, Reza Hassanian, Gabriele Cavallaro, Matthias Book, Helmut Neukirchen, Andreas Lintermann |
Practice and Experience in using Parallel and Scalable Machine Learning with Heterogenous Modular Supercomputing Architectures. |
IPDPS Workshops |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Huiyang Zhou, Jose Moreira, Frank Mueller 0001, Yoav Etsion (eds.) |
ICS '21: 2021 International Conference on Supercomputing, Virtual Event, USA, June 14-17, 2021. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Kaira Samuel, Jeremy Kepner, Michael Jones 0001, Lauren Milechin, Vijay Gadepally, William Arcand, David Bestor, William Bergeron, Chansup Byun, Matthew Hubbell, Michael Houle 0001, Anna Klein, Victor Lopez, Julie Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Sid Samsi, Charles Yee, Peter Michaleas |
Supercomputing Enabled Deployable Analytics for Disaster Response. |
HPEC |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Alexandros Nikolaos Ziogas, Tal Ben-Nun, Timo Schneider, Torsten Hoefler |
NPBench: a benchmarking suite for high-performance NumPy. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Akshay Bhosale, Rudolf Eigenmann |
On the automatic parallelization of subscripted subscript patterns using array property analysis. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Archit Patke, Saurabh Jha, Haoran Qiu, Jim M. Brandt, Ann C. Gentile, Joe Greenseid, Zbigniew Kalbarczyk, Ravishankar K. Iyer |
Delay sensitivity-driven congestion mitigation for HPC systems. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Brandon Neth, Thomas R. W. Scogland, Bronis R. de Supinski, Michelle Mills Strout |
Inter-loop optimization in RAJA using loop chains. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Xiaodong Yu 0001, Tekin Bicer, Rajkumar Kettimuthu, Ian T. Foster |
Topology-aware optimizations for multi-GPU ptychographic image reconstruction. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Markos Kynigos, Jose Antonio Pascual, Javier Navaridas, John Goodacre, Mikel Luján |
Power and energy efficient routing for Mach-Zehnder interferometer based photonic switches. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Yujia Zhai, Elisabeth Giem, Quan Fan, Kai Zhao 0008, Jinyang Liu, Zizhong Chen |
FT-BLAS: a high performance BLAS implementation with online fault tolerance. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Mazen Al-Wadi, Aziz Mohaisen, Amro Awad |
ProMT: optimizing integrity tree updates for write-intensive pages in secure NVMs. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Nader Al Awar, Steven Zhu, George Biros, Milos Gligoric 0001 |
A performance portability framework for Python. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Thomas Randall, Tyler N. Allen, Rong Ge 0002 |
FULL-W2V: fully exploiting data reuse for W2V on GPU-accelerated systems. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Jie Ren 0015, Jiaolin Luo, Ivy Bo Peng, Kai Wu 0006, Dong Li 0001 |
Optimizing large-scale plasma simulations on persistent memory-based heterogeneous memory with effective data placement across memory hierarchy. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | MohammadHossein Olyaiy, Christopher Ng, Mieszko Lis |
Accelerating DNNs inference with predictive layer fusion. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Kumudha Narasimhan, Aravind Acharya, Abhinav Baid, Uday Bondhugula |
A practical tile size selection model for affine loop nests. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Xin He, Jiawen Liu, Zhen Xie, Hao Chen 0002, Guoyang Chen, Weifeng Zhang 0003, Dong Li 0001 |
Enabling energy-efficient DNN training on hybrid GPU-FPGA accelerators. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Yuliana Zamora, Logan T. Ward, Ganesh Sivaraman, Ian T. Foster, Henry Hoffmann |
Proxima: accelerating the integration of machine learning in atomistic simulations. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Ming Dun, Yunchun Li, Hailong Yang, Qingxiao Sun, Zhongzhi Luan, Depei Qian |
An optimized tensor completion library for multiple GPUs. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Yaoyang Zhou, Zihao Yu, Chuanqi Zhang, Yinan Xu 0001, Huizhe Wang, Sa Wang, Ninghui Sun, Yungang Bao |
Omegaflow: a high-performance dependency-based architecture. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Siling Yang, Weijian Chen 0002, Xuechen Zhang 0001, Shuibing He, Yanlong Yin, Xian-He Sun |
AUTO-PRUNE: automated DNN pruning and mapping for ReRAM-based accelerator. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Chengming Zhang 0006, Geng Yuan, Wei Niu 0002, Jiannan Tian, Sian Jin, Donglin Zhuang, Zhe Jiang 0001, Yanzhi Wang, Bin Ren, Shuaiwen Leon Song, Dingwen Tao |
ClickTrain: efficient and accurate end-to-end deep learning training via fine-grained architecture-preserving pruning. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Xuan Huang, Pavol Klacansky, Steve Petruzza, Attila Gyulassy, Peer-Timo Bremer, Valerio Pascucci |
Distributed merge forest: a new fast and scalable approach for topological analysis at scale. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Xin Zhao, Jin Zhou, Hui Guan 0001, Wei Wang 0054, Xu Liu 0001, Tongping Liu |
NumaPerf: predictive NUMA profiling. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Zhen Xie, Wenqian Dong, Jie Liu, Ivy Bo Peng, Yanbao Ma, Dong Li 0001 |
MD-HM: memoization-based molecular dynamics simulations on big memory system. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Ahmed E. Helal, Jan Laukemann, Fabio Checconi, Jesmin Jahan Tithi, Teresa M. Ranadive, Fabrizio Petrini, Jeewhan Choi |
ALTO: adaptive linearized storage of sparse tensors. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Rohan Baskar Prabhakar, Sachit Kuhar, Rohit Agrawal 0001, Christopher J. Hughes, Christopher W. Fletcher |
SumMerge: an efficient algorithm and implementation for weight repetition-aware DNN inference. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Xuhao Chen 0001, Roshan Dathathri, Gurbinder Gill, Loc Hoang, Keshav Pingali |
Sandslash: a two-level framework for efficient graph pattern mining. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Shougang Yuan, Yan Solihin, Huiyang Zhou |
PSSM: achieving secure memory for GPUs with partitioned and sectored security metadata. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Gunduz Vehbi Demirci, Hakan Ferhatosmanoglu |
Partitioning sparse deep neural networks for scalable training and inference. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Wenwen Wang 0001, Pei-Hung Lin |
Does it matter?: OMPSanitizer: an impact analyzer of reported data races in OpenMP programs. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Peng Chen 0035, Mohamed Wahib, Xiao Wang 0004, Shin'ichiro Takizawa, Takahiro Hirofuchi, Hirotaka Ogawa, Satoshi Matsuoka |
Performance portable back-projection algorithms on CPUs: agnostic data locality and vectorization optimizations. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Doru-Thom Popovici, Andrew Canning, Zhengji Zhao, Lin-Wang Wang, John Shalf |
A systematic approach to improving data locality across Fourier transforms and linear algebra operations. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Chen Zhang, Zeyu Song, Haojie Wang, Kaiyuan Rong, Jidong Zhai |
HyQuas: hybrid partitioner based quantum circuit simulation system on GPU. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Amirhossein Mirhosseini, Thomas F. Wenisch |
μSteal: a theory-backed framework for preemptive work and resource stealing in mixed-criticality microservices. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Adrián Barredo, Adrià Armejach, Jonathan C. Beard, Miquel Moretó |
PLANAR: a programmable accelerator for near-memory data rearrangement. |
ICS |
2021 |
DBLP DOI BibTeX RDF |
|
|
|