The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Searching for GEMM with no syntactic query expansion in all metadata.

Publication years (Num. hits)
1993-2003 (16) 2004-2012 (15) 2013-2018 (17) 2019-2020 (23) 2021-2022 (26) 2023 (28) 2024 (8)
Publication types (Num. hits)
article(59) data(2) inproceedings(72)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
The graphs summarize 96 occurrences of 36 keywords

Results
Found 134 publication records. Showing 133 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
145Bo Kågström, Per Ling, Charles Van Loan GEMM-based level 3 BLAS: high-performance model implementations and performance evaluation benchmark. Search on Bibsonomy ACM Trans. Math. Softw. The full citation details ... 1998 DBLP  DOI  BibTeX  RDF GEMM-based level 3 BLAS, matrix-matrix kernels, parallelization, memory hierarchy, vectorization, FORTRAN 77, blocked algorithms
101Yinan Li 0002, Jack J. Dongarra, Stanimire Tomov A Note on Auto-tuning GEMM for GPUs. Search on Bibsonomy ICCS (1) The full citation details ... 2009 DBLP  DOI  BibTeX  RDF matrix multiply, GPUs, Auto-tuning, dense linear algebra
83Isak Jonsson, Bo Kågström Parallel Triangular Sylvester-Type Matrix Equation Solvers for SMP Systems Using Recursive Blocking. Search on Bibsonomy PARA The full citation details ... 2000 DBLP  DOI  BibTeX  RDF Sylvester-type matrix equations, recursion, superscalar, level 3 BLAS, GEMM-based, automatic blocking
78Michel J. Daydé, Iain S. Duff, Antoine Petitet A parallel block implementation of Level-3 BLAS for MIMD vector processors. Search on Bibsonomy ACM Trans. Math. Softw. The full citation details ... 1994 DBLP  DOI  BibTeX  RDF matrix-matrix kernels, parallelization, vectorization, Level-3 BLAS
66Bo Kågström, Charles Van Loan Algorithm 784: GEMM-based level 3 BLAS: portability and optimization issues. Search on Bibsonomy ACM Trans. Math. Softw. The full citation details ... 1998 DBLP  DOI  BibTeX  RDF GEMM-based level 3 BLAS, matrix-matrix kernels, parallelization, memory hierarchy, vectorization, FORTRAN 77, blocked algorithms
63Michael J. Feeley, Norman C. Hutchinson, Suprio Ray Realistic Mobility for Mobile Ad Hoc Network Simulation. Search on Bibsonomy ADHOC-NOW The full citation details ... 2004 DBLP  DOI  BibTeX  RDF GEMM, MANET, Mobility Model
59Ahmed Sherif Zekri, Stanislav G. Sedukhin The general matrix multiply-add operation on 2D torus. Search on Bibsonomy IPDPS The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
51John S. McCaskill, Thomas Maeke, Udo Gemm, Ludger Schulte, Uwe Tangen NGEN: A Massively Parallel Reconfigurable Computer for Biological Simulation: Towards a Self-Organizing Computer. Search on Bibsonomy ICES The full citation details ... 1996 DBLP  DOI  BibTeX  RDF
45Shixun Wu, Yujia Zhai, Jiajun Huang, Zizhe Jian, Zizhong Chen FT-GEMM: A Fault Tolerant High Performance GEMM Implementation on x86 CPUs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
45Shixun Wu, Yujia Zhai, Jiajun Huang, Zizhe Jian, Zizhong Chen FT-GEMM: A Fault Tolerant High Performance GEMM Implementation on x86 CPUs. Search on Bibsonomy HPDC The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
44Robert Granat, Bo Kågström Evaluating Parallel Algorithms for Solving Sylvester-Type Matrix Equations: Direct Transformation-Based Versus Iterative Matrix-Sign-Function-Based Methods. Search on Bibsonomy PARA The full citation details ... 2004 DBLP  DOI  BibTeX  RDF Sylvester matrix equation, Bartels–Stewart method, explicit blocking, c-stable matrices, PSLICOT, level 3 BLAS, continuous-time, GEMM-based, ScaLAPACK, Newton iteration, matrix sign function
44Bo Kågström Management of Deep Memory Hierarchies - Recursive Blocked Algorithms and Hybrid Data Structures for Dense Matrix Computations. Search on Bibsonomy PARA The full citation details ... 2004 DBLP  DOI  BibTeX  RDF automatic variable blocking, hybrid data structures, superscalar kernels, SMP parallelization, library software, ESSL, RECSY, periodic systems, factorizations, recursion, superscalar, LAPACK, level 3 BLAS, dense linear algebra, GEMM-based, SLICOT, matrix equations
44Robert Granat, Isak Jonsson, Bo Kågström Combining Explicit, Recursive Blocking for Solving Triangular Sylvester-Type Matrix Equations on Distributed Memory Platforms. Search on Bibsonomy Euro-Par The full citation details ... 2004 DBLP  DOI  BibTeX  RDF Sylvester matrix equation, Bartels–Stewart method, ScaLAPACK-style algorithms, RECSY, blocking, LAPACK, recursive algorithms, level 3 BLAS, continuous-time, GEMM-based, automatic blocking
44Isak Jonsson, Bo Kågström RECSY - A High Performance Library for Sylvester-Type Matrix Equations. Search on Bibsonomy Euro-Par The full citation details ... 2003 DBLP  DOI  BibTeX  RDF Sylvester-type matrix equations, RECSY, recursion, superscalar, LAPACK, level 3 BLAS, GEMM-based, SLICOT, automatic blocking
44Robert Granat, Bo Kågström, Peter Poromaa Parallel ScaLAPACK-Style Algorithms for Solving Continuous-Time Sylvester Matrix Equations. Search on Bibsonomy Euro-Par The full citation details ... 2003 DBLP  DOI  BibTeX  RDF Sylvester matrix equation, Bartels-Stewart method, ScaLAPACK-style algorithms, blocking, level 3 BLAS, continuous-time, GEMM-based, SLICOT
44Isak Jonsson, Bo Kågström Recursive blocked algorithms for solving triangular systems - Part I: one-sided and coupled Sylvester-type matrix equations. Search on Bibsonomy ACM Trans. Math. Softw. The full citation details ... 2002 DBLP  DOI  BibTeX  RDF SMP parallelization, generalized coupled Sylvester, standard Sylvester and Lyapunov, recursion, superscalar, LAPACK, level-3 BLAS, GEMM-based, SLICOT, Matrix equations, automatic blocking
44Isak Jonsson, Bo Kågström Recursive blocked algorithms for solving triangular systems - Part II: two-sided and generalized Sylvester and Lyapunov matrix equations. Search on Bibsonomy ACM Trans. Math. Softw. The full citation details ... 2002 DBLP  DOI  BibTeX  RDF SMP parallelization, generalized Sylvester and Lyapunov, standard discrete-time Sylvester and Lyapunov, recursion, superscalar, LAPACK, level-3 BLAS, GEMM-based, SLICOT, Matrix equations, automatic blocking
39Vasily Volkov, James Demmel Benchmarking GPUs to tune dense linear algebra. Search on Bibsonomy SC The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
39Bjarne Stig Andersen, Jerzy Wasniewski, Fred G. Gustavson A recursive formulation of Cholesky factorization of a matrix in packed storage. Search on Bibsonomy ACM Trans. Math. Softw. The full citation details ... 2001 DBLP  DOI  BibTeX  RDF Cholesky factorization and solution, complex Hermitian matrices, novel packed matrix data structures, real symmetric matrices, BLAS, recursive algorithms, positive definite matrices
39Fred G. Gustavson, Isak Jonsson High Performance Cholesky Factorization via Blocking and Recursion That Uses Minimal Storage. Search on Bibsonomy PARA The full citation details ... 2000 DBLP  DOI  BibTeX  RDF packed format, level 3 BLAS parallelism, recursive algorithm, Cholesky factorization, recursive data structure
39Michel J. Daydé, Iain S. Duff The RISC BLAS: a blocked implementation of level 3 BLAS for RISC processors. Search on Bibsonomy ACM Trans. Math. Softw. The full citation details ... 1999 DBLP  DOI  BibTeX  RDF matrix-matrix kernels, blocking, loop-unrolling, level 3 BLAS, RISC processors
24Samuel Williams 0001, John Shalf, Leonid Oliker, Shoaib Kamil 0001, Parry Husbands, Katherine A. Yelick Scientific Computing Kernels on the Cell Processor. Search on Bibsonomy Int. J. Parallel Program. The full citation details ... 2007 DBLP  DOI  BibTeX  RDF GEMM, SpMV, three level memory, FFT, sparse matrix, Cell processor, Stencil
24Samuel Williams 0001, John Shalf, Leonid Oliker, Shoaib Kamil 0001, Parry Husbands, Katherine A. Yelick The potential of the cell processor for scientific computing. Search on Bibsonomy Conf. Computing Frontiers The full citation details ... 2006 DBLP  DOI  BibTeX  RDF GEMM, SpMV, three level memory, FFT, sparse matrix, cell processor, stencil
23Susana Ortega-Cisneros Design and Implementation of an NoC-Based Convolution Architecture With GEMM and Systolic Arrays. Search on Bibsonomy IEEE Embed. Syst. Lett. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
23Cong Guo 0003, Fengchen Xue, Jingwen Leng, Yuxian Qiu, Yue Guan, Weihao Cui, Quan Chen 0002, Minyi Guo Accelerating Sparse DNNs Based on Tiled GEMM. Search on Bibsonomy IEEE Trans. Computers The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
23Bo Wang, Sheng Ma, Shengbai Luo, Lizhou Wu, Jianmin Zhang, Chunyuan Zhang, Tiejun Li SparGD: A Sparse GEMM Accelerator with Dynamic Dataflow. Search on Bibsonomy ACM Trans. Design Autom. Electr. Syst. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
23Venkata Sai Praneeth Karempudi, Sairam Sri Vatsavai, Ishan G. Thakkar, Oluwaseun Adewunmi Alo, Jeffrey Todd Hastings, Justin Scott Woods A Low-Dissipation and Scalable GEMM Accelerator with Silicon Nitride Photonics. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
23Sairam Sri Vatsavai, Venkata Sai Praneeth Karempudi, Oluwaseun Adewunmi Alo, Ishan G. Thakkar A Comparative Analysis of Microrings Based Incoherent Photonic GEMM Accelerators. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
23Cong Guo 0003, Fengchen Xue, Jingwen Leng, Yuxian Qiu, Yue Guan, Weihao Cui, Quan Chen 0002, Minyi Guo Accelerating Sparse DNNs Based on Tiled GEMM. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
23Jaeyong Jang, Yulhwa Kim, Juheun Lee, Jae-Joon Kim FIGNA: Integer Unit-Based Accelerator Design for FP-INT GEMM Preserving Numerical Accuracy. Search on Bibsonomy HPCA The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
23Seonghun Jeong, Jooyeon Lee, Jaeha Kung A Full SW-HW Demonstration of GEMM Accelerators with RISC-V Instruction Extensions. Search on Bibsonomy ICEIC The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
23Lili Xu, Binjie Chen, Chenhao Huang, Mengmeng Zhou, Shucheng You, Fangming Jiang, Weirong Chen, Jinsong Deng Identifying PM2.5-Related Health Burden in the Context of the Integrated Development of Urban Agglomeration Using Remote Sensing and GEMM Model. Search on Bibsonomy Remote. Sens. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Sandeep Kumar Sharma, Amit Chaurasia, Vijay Shankar Sharma, Chiranji Lal Chowdhary, Shakila Basheer GEMM, a Genetic Engineering-Based Mutual Model for Resource Allocation of Grid Computing. Search on Bibsonomy IEEE Access The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Jordi Fornt, Pau Fontova-Musté, Martí Caro, Jaume Abella 0001, Francesc Moll, Josep Altet, Christoph Studer An Energy-Efficient GeMM-Based Convolution Accelerator With On-the-Fly im2col. Search on Bibsonomy IEEE Trans. Very Large Scale Integr. Syst. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Iryna De Albuquerque Silva, Thomas Carle, Adrien Gauffriau, Claire Pagetti Extending a predictable machine learning framework with efficient gemm-based convolution routines. Search on Bibsonomy Real Time Syst. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Hyeonjin Kim, William J. Song LAS: Locality-Aware Scheduling for GEMM-Accelerated Convolutions in GPUs. Search on Bibsonomy IEEE Trans. Parallel Distributed Syst. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Louis Ledoux, Marc Casas Open-Source GEMM Hardware Kernels Generator: Toward Numerically-Tailored Computations. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Shixun Wu, Yujia Zhai, Jinyang Liu, Jiajun Huang, Zizhe Jian, Bryan M. Wong, Zizhong Chen Anatomy of High-Performance GEMM with Online Fault Tolerance on GPUs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Geonhwa Jeong, Sana Damani, Abhimanyu Rajeshkumar Bambhaniya, Eric Qin 0001, Christopher J. Hughes, Sreenivas Subramoney, Hyesoon Kim, Tushar Krishna VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Saeed Maleki Look-Up mAI GeMM: Increasing AI GeMMs Performance by Nearly 2.5x via msGeMM. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Bo Fang, Xinyi Li, Harvey Dam, Cheng Tan 0002, Siva Kumar Sastry Hari, Timothy Tsai 0002, Ignacio Laguna, Dingwen Tao, Ganesh Gopalakrishnan, Prashant J. Nair, Kevin J. Barker, Ang Li 0006 MPGemmFI: A Fault Injection Technique for Mixed Precision GEMM in ML Applications. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Ruqing G. Xu, Field G. Van Zee, Robert A. van de Geijn GEMMFIP: Unifying GEMM in BLIS. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Daniel Y. Fu, Simran Arora, Jessica Grogan, Isys Johnson, Sabri Eyuboglu, Armin W. Thomas, Benjamin Spector, Michael Poli, Atri Rudra, Christopher Ré Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Devangi N. Parikh, Robert A. van de Geijn, Greg M. Henry Cascading GEMM: High Precision from Low Precision. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Enrico Reggiani, Alessandro Pappalardo, Max Doblas, Miquel Moretó, Mauro Olivieri, Osman Sabri Unsal, Adrián Cristal Mix-GEMM: An efficient HW-SW Architecture for Mixed-Precision Quantized Deep Neural Networks Inference on Edge Devices. Search on Bibsonomy HPCA The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Ranggi Hwang, Minhoo Kang, Jiwon Lee, Dongyun Kam, Youngjoo Lee, Minsoo Rhu GROW: A Row-Stationary Sparse-Dense GEMM Accelerator for Memory-Efficient Graph Convolutional Neural Networks. Search on Bibsonomy HPCA The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Geonhwa Jeong, Sana Damani, Abhimanyu Rajeshkumar Bambhaniya, Eric Qin 0001, Christopher J. Hughes, Sreenivas Subramoney, Hyesoon Kim, Tushar Krishna VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUs. Search on Bibsonomy HPCA The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Susmita Dey Manasi, Suvadeep Banerjee, Abhijit Davare, Anton A. Sorokin, Steven M. Burns, Desmond A. Kirkpatrick, Sachin S. Sapatnekar Reusing GEMM Hardware for Efficient Execution of Depthwise Separable Convolution on ASIC-Based DNN Accelerators. Search on Bibsonomy ASP-DAC The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Jie Lei, Héctor Martínez, José Flich, Enrique S. Quintana-Ortí GEMM-Like Convolution for Deep Learning Inference on the Xilinx Versal. Search on Bibsonomy ISC Workshops The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Guosheng Yu, Zhihong Lv, Haijiang Wang, Zilong Huang, Jicheng Chen Task-aware Scheduling and Performance Optimization on Yitian710 SoC for GEMM-based Workloads on the Cloud. Search on Bibsonomy AICAS The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23RuQing G. Xu, Field G. Van Zee, Robert A. van de Geijn Towards a Unified Implementation of GEMM in BLIS. Search on Bibsonomy ICS The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Shixun Wu, Yujia Zhai, Jinyang Liu, Jiajun Huang, Zizhe Jian, Bryan M. Wong, Zizhong Chen Anatomy of High-Performance GEMM with Online Fault Tolerance on GPUs. Search on Bibsonomy ICS The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Alexey Romanov, Andrei Turkin, Oleg Myakinin, Fiodar Tsupko, Jiexing Gao Parameter Estimation via Time Modeling for MLIR Implementation of GEMM. Search on Bibsonomy OPTIMA The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Yongseung Yu, Donghyun Son, Younghyun Lee, Sunghyun Park 0004, Giha Ryu, Myeongjin Cho, Jiwon Seo 0002, Yongjun Park 0001 Tailoring CUTLASS GEMM using Supervised Learning. Search on Bibsonomy ICCD The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Harideep Nair, Prabhu Vellaisamy, Albert Chen, Joseph Finn, Anna Li, Manav Trivedi, John Paul Shen tuGEMM: Area-Power-Efficient Temporal Unary GEMM Architecture for Low-Precision Edge AI. Search on Bibsonomy ISCAS The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Daniel Y. Fu, Simran Arora, Jessica Grogan, Isys Johnson, Evan Sabri Eyuboglu, Armin W. Thomas, Benjamin Spector, Michael Poli, Atri Rudra, Christopher Ré Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture. Search on Bibsonomy NeurIPS The full citation details ... 2023 DBLP  BibTeX  RDF
23Tahsin Tariq Banna, Swakshar Deb, Sejuti Rahman, Shafin Rahman GEMM: A Graph Embedded Model for Memorability Prediction. Search on Bibsonomy IJCNN The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Zhiwei Yang, Lu Lu, Ruimin Wang A batched GEMM optimization framework for deep learning. Search on Bibsonomy J. Supercomput. The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Thomas Faingnaert, Tim Besard, Bjorn De Sutter Flexible Performant GEMM Kernels on GPUs. Search on Bibsonomy IEEE Trans. Parallel Distributed Syst. The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Sergio Barrachina 0001, Manuel F. Dolz, Pablo San Juan, Enrique S. Quintana-Ortí Efficient and portable GEMM-based convolution operators for deep neural network training on multicore processors. Search on Bibsonomy J. Parallel Distributed Comput. The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Nihat Mert Cicek, Xipeng Shen, Ozcan Ozturk 0001 Energy Efficient Boosting of GEMM Accelerators for DNN via Reuse. Search on Bibsonomy ACM Trans. Design Autom. Electr. Syst. The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Yunan Zhang, Po-An Tsai, Hung-Wei Tseng 0001 SIMD2: A Generalized Matrix Instruction Set for Accelerating Tensor Computation beyond GEMM. Search on Bibsonomy CoRR The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Jianyu Yao, Boqian Shi, Chunyang Xiang, Haipeng Jia, Chendi Li, Hang Cao, Yunquan Zhang IAAT: A Input-Aware Adaptive Tuning framework for Small GEMM. Search on Bibsonomy CoRR The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Minhoo Kang, Ranggi Hwang, Jiwon Lee, Dongyun Kam, Youngjoo Lee, Minsoo Rhu GROW: A Row-Stationary Sparse-Dense GEMM Accelerator for Memory-Efficient Graph Convolutional Neural Networks. Search on Bibsonomy CoRR The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Mark Gates, Asim YarKhan, Dalal Sukkari, Kadir Akbudak, Sébastien Cayrols, Daniel Bielich, Mohammed A. Al Farhan, Jack J. Dongarra Reproducability Artifact for Running SLATE's GEMM and POTRF Operations on Summit and Crusher. Search on Bibsonomy 2022   DOI  RDF
23Mark Gates, Asim YarKhan, Dalal Sukkari, Kadir Akbudak, Sébastien Cayrols, Daniel Bielich, Ahmad Abdelfattah, Mohammed A. Al Farhan, Jack J. Dongarra Reproducability Artifact for Running SLATE's GEMM and POTRF Operations on Summit and Crusher. Search on Bibsonomy 2022   DOI  RDF
23Bo Wang, Sheng Ma, Zhong Liu, Libo Huang, Yuan Yuan 0034, Yi Dai SADD: A Novel Systolic Array Accelerator with Dynamic Dataflow for Sparse GEMM in Deep Learning. Search on Bibsonomy NPC The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Cunyang Wei, Haipeng Jia, Yunquan Zhang, Kun Li, Luhan Wang LBBGEMM: A Load-balanced Batch GEMM Framework on ARM CPU s. Search on Bibsonomy HPCC/DSS/SmartCity/DependSys The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Arthur Francisco Lorenzon, Sandro Matheus V. N. Marques, Antoni C. Navarro, Vicenç Beltran 0001 Seamless optimization of the GEMM kernel for task-based programming models. Search on Bibsonomy ICS The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Chunhua Xiao, Chen Shi, Dandan Xu, Fangzhu Lin, Kun Ning SDST-Accelerating GEMM-based Convolution through Smart Data Stream Transformation. Search on Bibsonomy CCGRID The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Bo Wang, Sheng Ma, Yuan Yuan 0034, Yi Dai, Wei Jiang, Xiang Hou, Xiao Yi, Rui Xu SparG: A Sparse GEMM Accelerator for Deep Learning Applications. Search on Bibsonomy ICA3PP The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Dennis Agyemanh Nana Gookyi, Eunchong Lee, Kyungho Kim, Sung-Joon Jang, Sang-Seol Lee Exploring GEMM Operations on Different Configurations of the Gemmini Accelerator. Search on Bibsonomy ISOCC The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Bingyi Zhang, Akhilesh R. Jaiswal, Clynn Mathew, Ravi Teja Lakkireddy, Ajey P. Jacob, Sasindu Wijeratne, Viktor K. Prasanna Modeling the Energy Efficiency of GEMM using Optical Random Access Memory. Search on Bibsonomy HPEC The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Yunan Zhang, Po-An Tsai, Hung-Wei Tseng 0001 SIMD2: a generalized matrix instruction set for accelerating tensor computation beyond GEMM. Search on Bibsonomy ISCA The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Ananda Samajdar, Eric Qin 0001, Michael Pellauer, Tushar Krishna Self adaptive reconfigurable arrays (SARA): learning flexible GEMM accelerator configuration and mapping-space using ML. Search on Bibsonomy DAC The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Di Wu 0016, Jingjie Li, Ruokai Yin, Hsuan Hsiao, Younghyun Kim 0001, Joshua San Miguel uGEMM: Unary Computing for GEMM Applications. Search on Bibsonomy IEEE Micro The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
23Qingchang Han, Hailong Yang, Ming Dun, Zhongzhi Luan, Lin Gan, Guangwen Yang, Depei Qian Towards efficient tile low-rank GEMM computation on sunway many-core processors. Search on Bibsonomy J. Supercomput. The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
23Mochamad Asri, Dhairya Malhotra, Jiajun Wang, George Biros, Lizy K. John, Andreas Gerstlauer Hardware Accelerator Integration Tradeoffs for High-Performance Computing: A Case Study of GEMM Acceleration in N-Body Methods. Search on Bibsonomy IEEE Trans. Parallel Distributed Syst. The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
23Ananda Samajdar, Michael Pellauer, Tushar Krishna Self-Adaptive Reconfigurable Arrays (SARA): Using ML to Assist Scaling GEMM Acceleration. Search on Bibsonomy CoRR The full citation details ... 2021 DBLP  BibTeX  RDF
23Ratko Pilipovic, Vladimir Risojevic, Janko Bozic, Patricio Bulic, Uros Lotric An Approximate GEMM Unit for Energy-Efficient Object Detection. Search on Bibsonomy Sensors The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
23Reza Hojabr, Ali Sedaghati, Amirali Sharifian, Ahmad Khonsari, Arrvindh Shriraman SPAGHETTI: Streaming Accelerators for Highly Sparse GEMM on FPGAs. Search on Bibsonomy HPCA The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
23Jianyu Yao, Boqian Shi, Chunyang Xiang, Haipeng Jia, Chendi Li, Hang Cao, Yunquan Zhang IAAT: A Input-Aware Adaptive Tuning framework for Small GEMM. Search on Bibsonomy ICPADS The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
23Malith Jayaweera, Kaustubh Shivdikar, Yanzhi Wang, David R. Kaeli JAXED: Reverse Engineering DNN Architectures Leveraging JIT GEMM Libraries. Search on Bibsonomy SEED The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
23Zhi Gang Liu, Paul N. Whatmough, Matthew Mattina Systolic Tensor Array: An Efficient Structured-Sparse GEMM Accelerator for Mobile CNN Inference. Search on Bibsonomy IEEE Comput. Archit. Lett. The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
23Uday Bondhugula High Performance Code Generation in MLIR: An Early Case Study with GEMM. Search on Bibsonomy CoRR The full citation details ... 2020 DBLP  BibTeX  RDF
23Zhi Gang Liu, Paul N. Whatmough, Matthew Mattina Systolic Tensor Array: An Efficient Structured-Sparse GEMM Accelerator for Mobile CNN Inference. Search on Bibsonomy CoRR The full citation details ... 2020 DBLP  BibTeX  RDF
23Thomas Faingnaert, Tim Besard, Bjorn De Sutter Flexible Performant GEMM Kernels on GPUs. Search on Bibsonomy CoRR The full citation details ... 2020 DBLP  BibTeX  RDF
23Natalie Beams, Ahmad Abdelfattah, Stan Tomov, Jack J. Dongarra, Tzanio V. Kolev, Yohann Dudouit High-Order Finite Element Method using Standard and Device-Level Batch GEMM on GPUs. Search on Bibsonomy ScalA@SC The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
23Eric Qin 0001, Ananda Samajdar, Hyoukjun Kwon, Vineet Nadella, Sudarshan Srinivasan, Dipankar Das 0002, Bharat Kaul, Tushar Krishna SIGMA: A Sparse and Irregular GEMM Accelerator with Flexible Interconnects for DNN Training. Search on Bibsonomy HPCA The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
23Ioannis Oroutzoglou, Dimosthenis Masouros, Konstantina Koliogeorgi, Sotirios Xydis, Dimitrios Soudris Exploration of GPU sharing policies under GEMM workloads. Search on Bibsonomy SCOPES The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
23Guoning Lu, Dong Xu 0015, Ning Wang, Xiao Zhang, Degen Zhen, Hong Lei, Yunlong Bai, Dehui Kong, Hang Ruan, Zhifeng Chi, Xiankui Xiong, Ke Xu 0014 A Design of 16TOPS Efficient GEMM Module in Deep Learning Accelerator. Search on Bibsonomy ICTA The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
23Yunping Zhao, Jianzhuang Lu, Xiaowen Chen A Design of GEMM Parallel Computing Accelerator Based on Vector SIMD Technology. Search on Bibsonomy ICCTA The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
23Philip Colangelo, Shayan Sengupta, Martin Margala Sparse Persistent GEMM Accelerator using OpenCL for Intel FPGAs. Search on Bibsonomy ISCAS The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
23Andrew Anderson 0001, Aravind Vasudevan, Cormac Keane, David Gregg High-Performance Low-Memory Lowering: GEMM-based Algorithms for DNN Convolution. Search on Bibsonomy SBAC-PAD The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
23Sheng Wei Pang, Chai Quek, Dilip K. Prasad GEMM-eMFIS (FRI/E): A Novel General Episodic Memory Mechanism For Fuzzy Neural Networks. Search on Bibsonomy IJCNN The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
23Di Wu 0016, Jingjie Li, Ruokai Yin, Hsuan Hsiao, Younghyun Kim 0001, Joshua San Miguel UGEMM: Unary Computing Architecture for GEMM Applications. Search on Bibsonomy ISCA The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
23S. Kala, Babita R. Jose, Jimson Mathew, Nalesh Sivanandan High-Performance CNN Accelerator on FPGA Using Unified Winograd-GEMM Architecture. Search on Bibsonomy IEEE Trans. Very Large Scale Integr. Syst. The full citation details ... 2019 DBLP  DOI  BibTeX  RDF
23Roktaek Lim, Yeongha Lee, Raehyun Kim, Jaeyoung Choi, Myungho Lee Auto-tuning GEMM kernels on the Intel KNL and Intel Skylake-SP processors. Search on Bibsonomy J. Supercomput. The full citation details ... 2019 DBLP  DOI  BibTeX  RDF
23Xing Su, Xiangke Liao, Hao Jiang 0001, Canqun Yang, Jingling Xue SCP: Shared Cache Partitioning for High-Performance GEMM. Search on Bibsonomy ACM Trans. Archit. Code Optim. The full citation details ... 2019 DBLP  DOI  BibTeX  RDF
23Wenlei Bao, Li-Wen Chang, Yang Chen, Ke Deng, Amit Agarwal, Emad Barsoum, Abe Taha NGEMM: Optimizing GEMM for Deep Learning via Compiler-based Techniques. Search on Bibsonomy CoRR The full citation details ... 2019 DBLP  BibTeX  RDF
Displaying result #1 - #100 of 133 (100 per page; Change: )
Pages: [1][2][>>]
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by L3S.
Previously maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.
open data data released under the ODC-BY 1.0 license