The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Searching for GPUs with no syntactic query expansion in all metadata.

Publication years (Num. hits)
2003-2004 (33) 2005 (50) 2006 (78) 2007 (89) 2008 (135) 2009 (195) 2010 (163) 2011 (200) 2012 (206) 2013 (242) 2014 (310) 2015 (242) 2016 (260) 2017 (257) 2018 (291) 2019 (288) 2020 (275) 2021 (282) 2022 (305) 2023 (266) 2024 (57)
Publication types (Num. hits)
article(1550) book(2) incollection(31) inproceedings(2570) phdthesis(66) proceedings(5)
Venues (Conferences, Journals, ...)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
The graphs summarize 894 occurrences of 448 keywords

Results
Found 4224 publication records. Showing 4224 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
10Jeongmin Hong, Sungjun Cho, Geonwoo Park, Wonhyuk Yang, Young-Ho Gong, Gwangsun Kim Bandwidth-Effective DRAM Cache for GPUs with Storage-Class Memory. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Herbert Owen, Dominik Ernst, Thomas Gruber, Oriol Lehmkuhl, Guillaume Houzeaux, Lucas Gasparino, Gerhard Wellein Alya towards Exascale: Optimal OpenACC Performance of the Navier-Stokes Finite Element Assembly on GPUs. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Ziheng Jiang, Haibin Lin, Yinmin Zhong, Qi Huang, Yangrui Chen, Zhi Zhang, Yanghua Peng, Xiang Li, Cong Xie, Shibiao Nong, Yulu Jia, Sun He, Hongmin Chen, Zhihao Bai, Qi Hou, Shipeng Yan, Ding Zhou, Yiyao Sheng, Zhuo Jiang, Haohan Xu, Haoran Wei, Zhang Zhang, Pengfei Nie, Leqi Zou, Sida Zhao, Liang Xiang, Zherui Liu, Zhe Li, Xiaoying Jia 0001, Jianxi Ye, Xin Jin, Xin Liu MegaScale: Scaling Large Language Model Training to More Than 10, 000 GPUs. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Luchang Li, Sheng Qian, Jie Lu, Lunxi Yuan, Rui Wang, Qin Xie Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10John R. Tramm, Paul K. Romano, Patrick C. Shriwise, Amanda Lund, Johannes Doerfert, Patrick Steinbrecher, Andrew R. Siegel, Gavin Ridley Performance Portable Monte Carlo Particle Transport on Intel, NVIDIA, and AMD GPUs. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Abhinav Jangda, Mohit Yadav Fast Kronecker Matrix-Matrix Multiplication on GPUs. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Kai Yuan, Christoph Bauinger, Xiangyi Zhang, Pascal Baehr, Matthias Kirchhart, Darius Dabert, Adrien Tousnakhoff, Pierre Boudier, Michael Paulitsch Fully-fused Multi-Layer Perceptrons on Intel Data Center GPUs. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Frederik Dermot Pustelnik, Xhani Marvin Saß, Jean-Pierre Seifert Whispering Pixels: Exploiting Uninitialized Register Accesses in Modern GPUs. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Gobikrishna Dhanuskodi, Sudeshna Guha, Vidhya Krishnan, Aruna Manjunatha, Rob Nertney, Michael O'Connor, Phil Rogers Creating the First Confidential GPUs. Search on Bibsonomy Commun. ACM The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Yutian Chen, Cong Peng 0005, Yu Dai, Min Luo 0002, Debiao He Load-Balanced Parallel Implementation on GPUs for Multi-Scalar Multiplication Algorithm. Search on Bibsonomy IACR Trans. Cryptogr. Hardw. Embed. Syst. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Xudong Zhu, Haoqi He, Zhengbang Yang, Yi Deng, Lutan Zhao, Rui Hou 0001 Elastic MSM: A Fast, Elastic and Modular Preprocessing Technique for Multi-Scalar Multiplication Algorithm on GPUs. Search on Bibsonomy IACR Cryptol. ePrint Arch. The full citation details ... 2024 DBLP  BibTeX  RDF
10Wen-Hsiang Chou, Cheng-Han Wu, Shih-Chun Jin, Jyh-Cheng Chen Iterative Reconstruction of Micro Computed Tomography Scans Using Multiple Heterogeneous GPUs. Search on Bibsonomy Sensors The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Hongru Gao, Xiaofei Liao, Zhiyuan Shao, Kexin Li, Jiajie Chen, Hai Jin 0001 A survey on dynamic graph processing on GPUs: concepts, terminologies and systems. Search on Bibsonomy Frontiers Comput. Sci. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Malith Jayaweera, Martin Kong, Yanzhi Wang, David R. Kaeli Energy-Aware Tile Size Selection for Affine Programs on GPUs. Search on Bibsonomy CGO The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Alnis Murtovi, Giorgis Georgakoudis, Konstantinos Parasyris, Chunhua Liao, Ignacio Laguna, Bernhard Steffen Enhancing Performance Through Control-Flow Unmerging and Loop Unrolling on GPUs. Search on Bibsonomy CGO The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Aditi Saha, Mohammad Rahman, Fan Wu 0013 Evaluating LSTM Time Series Prediction Performance on Benchmark CPUs and GPUs in Cloud Environments. Search on Bibsonomy ACM Southeast Regional Conference The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Tong Zhou, Jun Shirako, Vivek Sarkar APPy: Annotated Parallelism for Python on GPUs. Search on Bibsonomy CC The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Brian Homerding, Arturo Vargas, Tom Scogland, Robert Chen, Mike Davis, Rich Hornung Enabling RAJA on Intel GPUs with SYCL. Search on Bibsonomy IWOCL The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Ioanna-Maria Panagou, Nikolaos Bellas, Lorenzo Moneta, Sanjiban Sengupta Accelerating Machine Learning Inference on GPUs with SYCL. Search on Bibsonomy IWOCL The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Luigi Crisci, Lorenzo Carpentieri, Peter Thoman, Aksel Alpay, Vincent Heuveline, Biagio Cosenza SYCL-Bench 2020: Benchmarking SYCL 2020 on AMD, Intel, and NVIDIA GPUs. Search on Bibsonomy IWOCL The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Ziheng Jiang, Haibin Lin, Yinmin Zhong, Qi Huang, Yangrui Chen, Zhi Zhang, Yanghua Peng, Xiang Li, Cong Xie, Shibiao Nong, Yulu Jia, Sun He, Hongmin Chen, Zhihao Bai, Qi Hou, Shipeng Yan, Ding Zhou, Yiyao Sheng, Zhuo Jiang, Haohan Xu, Haoran Wei, Zhang Zhang, Pengfei Nie, Leqi Zou, Sida Zhao, Liang Xiang, Zherui Liu, Zhe Li, Xiaoying Jia 0001, Jianxi Ye, Xin Jin, Xin Liu MegaScale: Scaling Large Language Model Training to More Than 10, 000 GPUs. Search on Bibsonomy NSDI The full citation details ... 2024 DBLP  BibTeX  RDF
10Hongwu Peng, Caiwen Ding, Tong Geng, Sutanay Choudhury, Kevin J. Barker, Ang Li 0006 Evaluating Emerging AI/ML Accelerators: IPU, RDU, and NVIDIA/AMD GPUs. Search on Bibsonomy ICPE (Companion) The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Ties Robroek, Ehsan Yousefzadeh-Asl-Miandoab, Pinar Tözün An Analysis of Collocation on GPUs for Deep Learning Training. Search on Bibsonomy EuroMLSys@EuroSys The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Connor Espenshade, Rachel Peng, Eumin Hong, Max Calman, Yue Zhu, Pritish Parida, Eun Kyung Lee, Martha A. Kim Characterizing Training Performance and Energy for Foundation Models and Image Classifiers on Multi-Instance GPUs. Search on Bibsonomy EuroMLSys@EuroSys The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Jiacheng Yang, Christina Giannoula, Jun Wu, Mostafa Elhoushi, James Gleeson 0001, Gennady Pekhimenko Minuet: Accelerating 3D Sparse Convolutions on GPUs. Search on Bibsonomy EuroSys The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Abhinav Jangda, Mohit Yadav Fast Kronecker Matrix-Matrix Multiplication on GPUs. Search on Bibsonomy PPoPP The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Meng Pang, Xiang Fei, Peng Qu, Youhui Zhang, Zhaolin Li A Row Decomposition-based Approach for Sparse Matrix Multiplication on GPUs. Search on Bibsonomy PPoPP The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Zhuoran Ji, Zhaorui Zhang, Jiming Xu, Lei Ju 0001 POSTER: Accelerating High-Precision Integer Multiplication used in Cryptosystems with GPUs. Search on Bibsonomy PPoPP The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Yifei Li, Bole Zhou, Jiejing Zhang, Xuechao Wei, Yinghan Li, Yingda Chen POSTER: RadiK: Scalable Radix Top-K Selection on GPUs. Search on Bibsonomy PPoPP The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Kentaro Katayama, Noboru Yoneoka, Kouichi Kanda, Hirotaka Tamura, Hiroshi Nakayama, Yasuhiro Watanabe Digital Annealing Engine for High-speed Solving of Constrained Binary Quadratic Problems on Multiple GPUs. Search on Bibsonomy ICCE The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Michael Davies, Ian McDougall, Selvaraj Anandaraj, Deep Machchhar, Rithik Jain, Karthikeyan Sankaralingam A Journey of a 1, 000 Kernels Begins with a Single Step: A Retrospective of Deep Learning on GPUs. Search on Bibsonomy ASPLOS (2) The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Tianao Ge, Tong Zhang, Hongyuan Liu 0002 ngAP: Non-blocking Large-scale Automata Processing on GPUs. Search on Bibsonomy ASPLOS (1) The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Dwi P. A. Nugroho, Philipp M. Grulich, Steffen Zeuch, Clemens Lutz, Stefano Bortoli, Volker Markl Benchmarking Stream Join Algorithms on GPUs: A Framework and its Application to the State-of-the-art. Search on Bibsonomy EDBT The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
10Chonglin Zhang, Gerrett Diamond, Cameron W. Smith, Mark S. Shephard Development of an unstructured mesh gyrokinetic particle-in-cell code for exascale fusion plasma simulations on GPUs. Search on Bibsonomy Comput. Phys. Commun. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Hongyuan Liu 0002, Sreepathi Pai, Adwait Jog Asynchronous Automata Processing on GPUs. Search on Bibsonomy Proc. ACM Meas. Anal. Comput. Syst. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Xia Zhao 0004, Guangda Zhang, Lu Wang, Yangmei Li, Yongjun Zhang RouteReplies: Alleviating Long Latency in Many-Chip-Module GPUs. Search on Bibsonomy IEEE Comput. Archit. Lett. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Meng Wu, Mingyu Yan, Xiaocheng Yang, Wenming Li, Zhimin Zhang 0004, Xiaochun Ye, Dongrui Fan Characterizing and Understanding Defense Methods for GNNs on GPUs. Search on Bibsonomy IEEE Comput. Archit. Lett. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Pratyush Patel, Zibo Gong, Syeda Rizvi, Esha Choukse, Pulkit A. Misra, Thomas E. Anderson, Akshitha Sriraman Towards Improved Power Management in Cloud GPUs. Search on Bibsonomy IEEE Comput. Archit. Lett. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Manthan Verma, Soumyadeep Chatterjee, Gaurav Garg, Bharatkumar Sharma, Nishant Arya, Sashi Kumar, Anish Saxena, Mahendra K. Verma Scalable Multi-node Fast Fourier Transform on GPUs. Search on Bibsonomy SN Comput. Sci. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Gonzalo Berger, Manuel Freire 0002, Renzo Marini, Ernesto Dufrechou, Pablo Ezzatti Advancing on an efficient sparse matrix multiplication kernel for modern GPUs. Search on Bibsonomy Concurr. Comput. Pract. Exp. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Tomohiro Imanaga, Koji Nakano, Ryota Yasudo, Yasuaki Ito, Yuya Kawamata, Ryota Katsuki, Yusuke Tabata, Takashi Yazane, Kenichiro Hamano Simple iterative trial search for the maximum independent set problem optimized for the GPUs. Search on Bibsonomy Concurr. Comput. Pract. Exp. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Pietro Incardona, Aryaman Gupta, Serhii Yaskovets, Ivo F. Sbalzarini A portable C++ library for memory and compute abstraction on multi-core CPUs and GPUs. Search on Bibsonomy Concurr. Comput. Pract. Exp. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Yu-Hsiang Tsai, Terry Cojean, Hartwig Anzt Providing performance portable numerics for Intel GPUs. Search on Bibsonomy Concurr. Comput. Pract. Exp. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Mi Li, Eibe Frank, Bernhard Pfahringer Large scale K-means clustering using GPUs. Search on Bibsonomy Data Min. Knowl. Discov. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Neal Livesay, Gilbert Jonatan, Evelio Mora, Kaustubh Shivdikar, Rashmi Agrawal 0001, Ajay Joshi, José L. Abellán, John Kim, David R. Kaeli Accelerating Finite Field Arithmetic for Homomorphic Encryption on GPUs. Search on Bibsonomy IEEE Micro The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Shiqing Zhang, Ziyue Zhang, Mahmood Naderan-Tahan, Hossein SeyyedAghaei, Xin Wang, He Li, Senbiao Qin, Didier Colle, Guy Torfs, Mario Pickavet, Johan Bauwelinck, Günther Roelkens, Lieven Eeckhout Photonic Network-on-Wafer for Multichiplet GPUs. Search on Bibsonomy IEEE Micro The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Jakub Homola, Michal Merta, Jan Zapletal Acceleration of the space-time boundary element method using GPUs. Search on Bibsonomy Adv. Eng. Softw. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Shilong Wang, Hang Liu 0001, Anil Gaihre, Hengyong Yu ezLDA: Efficient and Scalable LDA on GPUs. Search on Bibsonomy IEEE Access The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Daegun Yoon, Minjoong Jeong, Sangyoon Oh 0001 WAVE: designing a heuristics-based three-way breadth-first search on GPUs. Search on Bibsonomy J. Supercomput. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Hojin Cho, Myungsun Kim gCFS: completely fair scheduling on multiple GPUs for improved multi-DNN execution in terms of performance isolation. Search on Bibsonomy J. Supercomput. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Taghreed Bagies, Wei Le, Jeremy Sheaffer, Ali Jannesari Reducing branch divergence to speed up parallel execution of unit testing on GPUs. Search on Bibsonomy J. Supercomput. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Yonghua Zhang, Hongxu Jiang, Yuting Zhu, Runhua Zhang, Yongxiang Cao, Chenhui Zhu, Wei Wang, Dong Dong, Xiaobin Li LOCP: Latency-optimized channel pruning for CNN inference acceleration on GPUs. Search on Bibsonomy J. Supercomput. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Daniel Di Domenico, João V. F. Lima, Gerson G. H. Cavalheiro NAS Parallel Benchmarks with Python: a performance and programming effort analysis focusing on GPUs. Search on Bibsonomy J. Supercomput. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Manuel de Castro, Inmaculada Santamaria-Valenzuela, Yuri Torres, Arturo González-Escribano, Diego R. Llanos EPSILOD: efficient parallel skeleton for generic iterative stencil computations in distributed GPUs. Search on Bibsonomy J. Supercomput. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Tejin Cai, Kenneth Herner, Tingjun Yang, Michael Wang 0003, Maria Acosta Flechas, Philip C. Harris, Burt Holzman, Kevin Pedro, Nhan Tran Accelerating Machine Learning Inference with GPUs in ProtoDUNE Data Processing. Search on Bibsonomy Comput. Softw. Big Sci. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Song Liu 0007, Jie Ma, Chenyu Zhao, Xinhe Wan, Weiguo Wu LFWS: Long-Operation First Warp Scheduling Algorithm to Effectively Hide the Latency for GPUs. Search on Bibsonomy IEICE Trans. Fundam. Electron. Commun. Comput. Sci. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Siham Boukhris, Artem Napov, Yvan Notay Algebraic Multigrid Using a Stencil-CSR Hybrid Format on GPUs. Search on Bibsonomy SIAM J. Sci. Comput. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Heajung Min, Kyung Min Han, Young J. Kim OctoMap-RT: Fast Probabilistic Volumetric Mapping Using Ray-Tracing GPUs. Search on Bibsonomy IEEE Robotics Autom. Lett. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Martin Karp, Daniele Massaro, Niclas Jansson, Alistair Hart, Jacob Wahlgren, Philipp Schlatter, Stefano Markidis Large-Scale direct numerical simulations of turbulence using GPUs and modern Fortran. Search on Bibsonomy Int. J. High Perform. Comput. Appl. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Yupeng Huang, Hong Zhang, Siyuan Jiang, Dajiong Yue, Xiaohan Lin, Jun Zhang, Yi Qin Gao DSDP: A Blind Docking Strategy Accelerated by GPUs. Search on Bibsonomy J. Chem. Inf. Model. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Thomas Romera, Andrea Petreto, Florian Lemaitre, Manuel Bouyer, Quentin L. Meunier, Lionel Lacassagne, Daniel Etiemble Optical flow algorithms optimized for speed, energy and accuracy on embedded GPUs. Search on Bibsonomy J. Real Time Image Process. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Jean-Sylvain Camier, Veselin Dobrev, Patrick Knupp, Tzanio V. Kolev, Ketan Mittal, Robert N. Rieben, Vladimir Z. Tomov Accelerating high-order mesh optimization using finite element partial assembly on GPUs. Search on Bibsonomy J. Comput. Phys. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Zichao Yang, Heng Wu, Yuanjia Xu, Yuewen Wu, Hua Zhong 0001, Wenbo Zhang 0006 Hydra: Deadline-Aware and Efficiency-Oriented Scheduling for Deep Learning Jobs on Heterogeneous GPUs. Search on Bibsonomy IEEE Trans. Computers The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Song Liu 0007, Zengyuan Zhang, Weiguo Wu DHTS: A Dynamic Hybrid Tiling Strategy for Optimizing Stencil Computation on GPUs. Search on Bibsonomy IEEE Trans. Computers The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Wenchao Wu, Xuanhua Shi, Ligang He, Hai Jin 0001 TurboGNN: Improving the End-to-End Performance for Sampling-Based GNN Training on GPUs. Search on Bibsonomy IEEE Trans. Computers The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Han Zhao 0005, Weihao Cui, Quan Chen 0002, Minyi Guo ISPA: Exploiting Intra-SM Parallelism in GPUs via Fine-Grained Resource Management. Search on Bibsonomy IEEE Trans. Computers The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Zhiwei Wang, Peinan Li, Rui Hou 0001, Zhihao Li, Jiangfeng Cao, XiaoFeng Wang 0001, Dan Meng HE-Booster: An Efficient Polynomial Arithmetic Acceleration on GPUs for Fully Homomorphic Encryption. Search on Bibsonomy IEEE Trans. Parallel Distributed Syst. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Shiyang Li, Ruiqi Tang, Jingyu Zhu, Ziyi Zhao, Xiaoli Gong, Wenwen Wang 0001, Jin Zhang 0003, Pen-Chung Yew Liberator: A Data Reuse Framework for Out-of-Memory Graph Computing on GPUs. Search on Bibsonomy IEEE Trans. Parallel Distributed Syst. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Hyeonjin Kim, William J. Song LAS: Locality-Aware Scheduling for GEMM-Accelerated Convolutions in GPUs. Search on Bibsonomy IEEE Trans. Parallel Distributed Syst. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Qiong Chang, Xiang Li, Yun Li, Jun Miyazaki Multi-directional Sobel operator kernel on GPUs. Search on Bibsonomy J. Parallel Distributed Comput. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Dominik Ernst, Markus Holzer 0005, Georg Hager, Matthias Knorr 0002, Gerhard Wellein Analytical performance estimation during code generation on modern GPUs. Search on Bibsonomy J. Parallel Distributed Comput. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Jan Lemeire, Jan G. Cornelis, Elias Konstantinidis Analysis of the analytical performance models for GPUs and extracting the underlying Pipeline model. Search on Bibsonomy J. Parallel Distributed Comput. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Diogo Nunes, Daniel Castro 0004, Paolo Romano 0002 CSMV: A highly scalable multi-versioned software transactional memory for GPUs. Search on Bibsonomy J. Parallel Distributed Comput. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Jou-An Chen, Hsin-Hsuan Sung, Xipeng Shen, Nathan R. Tallent, Kevin J. Barker, Ang Li 0006 Accelerating matrix-centric graph processing on GPUs through bit-level optimizations. Search on Bibsonomy J. Parallel Distributed Comput. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Aditya Kashi, Pratik Nayak, Dhruva Kulkarni, Aaron Scheinberg, Paul Lin, Hartwig Anzt Integrating batched sparse iterative solvers for the collision operator in fusion plasma simulations on GPUs. Search on Bibsonomy J. Parallel Distributed Comput. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Alessio Masola, Nicola Capodieci Optimization strategies for GPUs: an overview of architectural approaches. Search on Bibsonomy Int. J. Parallel Emergent Distributed Syst. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Alexander Krolik, Clark Verbrugge, Laurie J. Hendren rNdN: Fast Query Compilation for NVIDIA GPUs. Search on Bibsonomy ACM Trans. Archit. Code Optim. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Weizhi Xu 0001, Yintai Sun, Shengyu Fan, Hui Yu 0010, Xin Fu Accelerating Convolutional Neural Network by Exploiting Sparsity on GPUs. Search on Bibsonomy ACM Trans. Archit. Code Optim. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Jiangsu Du, Jiazhi Jiang, Jiang Zheng, Hongbin Zhang 0006, Dan Huang, Yutong Lu Improving Computation and Memory Efficiency for Real-world Transformer Inference on GPUs. Search on Bibsonomy ACM Trans. Archit. Code Optim. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Sooraj Puthoor, Mikko H. Lipasti Turn-based Spatiotemporal Coherence for GPUs. Search on Bibsonomy ACM Trans. Archit. Code Optim. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10N. S. Kapralov, A. Yu. Morozov, S. P. Nikulin Parallel Approximation of Multidimensional Tensors Using GPUs. Search on Bibsonomy Program. Comput. Softw. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Quim Aguado-Puig, Max Doblas, Christos Matzoros, Antonio Espinosa, Juan Carlos Moure, Santiago Marco-Sola, Miquel Moretó WFA-GPU: gap-affine pairwise read-alignment using GPUs. Search on Bibsonomy Bioinform. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Joël Lindegger, Damla Senol Cali, Mohammed Alser, Juan Gómez-Luna, Nika Mansouri-Ghiasi, Onur Mutlu Scrooge: a fast and memory-frugal genomic sequence aligner for CPUs, GPUs, and ASICs. Search on Bibsonomy Bioinform. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Lei Zou 0001, Fan Zhang 0050, Yinnian Lin, Yanpeng Yu An Efficient Data Structure for Dynamic Graph on GPUs. Search on Bibsonomy IEEE Trans. Knowl. Data Eng. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Aodong Chen, Fei Xu, Li Han, Yuan Dong, Li Chen, Zhi Zhou 0006, Fangming Liu Opara: Exploiting Operator Parallelism for Expediting DNN Inference on GPUs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Huaizheng Zhang, Yuanming Li, Wencong Xiao, Yizheng Huang 0001, Xing Di, Jianxiong Yin, Simon See, Yong Luo 0002, Chiew Tong Lau, Yang You MIGPerf: A Comprehensive Benchmark for Deep Learning Training and Inference Workloads on Multi-Instance GPUs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Jinyang Liu, Jiannan Tian, Shixun Wu, Sheng Di, Boyuan Zhang 0002, Yafan Huang, Kai Zhao 0008, Guanpeng Li, Dingwen Tao, Zizhong Chen, Franck Cappello cuSZ-I: High-Fidelity Error-Bounded Lossy Compression for Scientific Data on GPUs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Shiyang Chen, Da Zheng, Caiwen Ding, Chengying Huan, Yuede Ji, Hang Liu Tango: rethinking quantization for graph neural network training on GPUs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Jeongmin Brian Park, Zaid Qureshi, Vikram S. Mailthody, Andrew Gacek, Shunfan Shao, Mohammad Almasri, Isaac Gelado, Jinjun Xiong, Chris J. Newburn, I-Hsin Chung, Michael Garland, Nikolay Sakharnykh, Wen-Mei W. Hwu CODAG: Characterizing and Optimizing Decompression Algorithms for GPUs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Qiange Wang, Yao Chen 0008, Weng-Fai Wong, Bingsheng He HongTu: Scalable Full-Graph GNN Training on Multiple GPUs (via communication-optimized CPU data offloading). Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Qiong Chang, Xin Li, Yun Li, Jun Miyazaki Multi-directional Sobel operator kernel on GPUs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Rohith Krishnan S, Venkata Kalyan Tavva, Rupesh Nasre A Graph Data Structure to Optimize Dynamic Graph Processing on GPUs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Feng Pan, Hanfeng Gu, Lvlin Kuang, Bing Liu, Pan Zhang Efficient Quantum Circuit Simulation by Tensor Network Methods on Modern GPUs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Benjamin Brock, Aydin Buluç, Katherine A. Yelick RDMA-Based Algorithms for Sparse Matrix Multiplication on GPUs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Shilei Tian, Tom Scogland, Barbara M. Chapman, Johannes Doerfert GPU First - Execution of Legacy CPU Codes on GPUs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Stefan Zellmann, Serkan Demirci, Ugur Güdükbay Visual Analysis of Large Multi-Field AMR Data on GPUs Using Interactive Volume Lines. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Boyuan Zhang 0002, Jiannan Tian, Sheng Di, Xiaodong Yu 0001, Martin Swany, Dingwen Tao, Franck Cappello GPULZ: Optimizing LZSS Lossless Compression for Multi-byte Data on Modern GPUs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Shixun Wu, Yujia Zhai, Jinyang Liu, Jiajun Huang, Zizhe Jian, Bryan M. Wong, Zizhong Chen Anatomy of High-Performance GEMM with Online Fault Tolerance on GPUs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Yunuo Cen, Zhiwei Zhang, Xuanyao Fong Massively Parallel Continuous Local Search for Hybrid SAT Solving on GPUs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
10Chunyang Wang, Desen Sun, Yuebin Bai PiPAD: Pipelined and Parallel Dynamic GNN Training on GPUs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
Displaying result #301 - #400 of 4224 (100 per page; Change: )
Pages: [<<][1][2][3][4][5][6][7][8][9][10][11][12][13][>>]
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by L3S.
Previously maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.
open data data released under the ODC-BY 1.0 license