Results
Found 22 publication records. Showing 22 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
149 | Bo Kågström, Per Ling, Charles Van Loan |
GEMM-based level 3 BLAS: high-performance model implementations and performance evaluation benchmark. |
ACM Trans. Math. Softw. |
1998 |
DBLP DOI BibTeX RDF |
GEMM-based level 3 BLAS, matrix-matrix kernels, parallelization, memory hierarchy, vectorization, FORTRAN 77, blocked algorithms |
87 | Isak Jonsson, Bo Kågström |
Parallel Triangular Sylvester-Type Matrix Equation Solvers for SMP Systems Using Recursive Blocking. |
PARA |
2000 |
DBLP DOI BibTeX RDF |
Sylvester-type matrix equations, recursion, superscalar, level 3 BLAS, GEMM-based, automatic blocking |
71 | Bo Kågström, Charles Van Loan |
Algorithm 784: GEMM-based level 3 BLAS: portability and optimization issues. |
ACM Trans. Math. Softw. |
1998 |
DBLP DOI BibTeX RDF |
GEMM-based level 3 BLAS, matrix-matrix kernels, parallelization, memory hierarchy, vectorization, FORTRAN 77, blocked algorithms |
48 | Robert Granat, Bo Kågström |
Evaluating Parallel Algorithms for Solving Sylvester-Type Matrix Equations: Direct Transformation-Based Versus Iterative Matrix-Sign-Function-Based Methods. |
PARA |
2004 |
DBLP DOI BibTeX RDF |
Sylvester matrix equation, Bartels–Stewart method, explicit blocking, c-stable matrices, PSLICOT, level 3 BLAS, continuous-time, GEMM-based, ScaLAPACK, Newton iteration, matrix sign function |
48 | Robert Granat, Isak Jonsson, Bo Kågström |
Combining Explicit, Recursive Blocking for Solving Triangular Sylvester-Type Matrix Equations on Distributed Memory Platforms. |
Euro-Par |
2004 |
DBLP DOI BibTeX RDF |
Sylvester matrix equation, Bartels–Stewart method, ScaLAPACK-style algorithms, RECSY, blocking, LAPACK, recursive algorithms, level 3 BLAS, continuous-time, GEMM-based, automatic blocking |
48 | Isak Jonsson, Bo Kågström |
RECSY - A High Performance Library for Sylvester-Type Matrix Equations. |
Euro-Par |
2003 |
DBLP DOI BibTeX RDF |
Sylvester-type matrix equations, RECSY, recursion, superscalar, LAPACK, level 3 BLAS, GEMM-based, SLICOT, automatic blocking |
48 | Robert Granat, Bo Kågström, Peter Poromaa |
Parallel ScaLAPACK-Style Algorithms for Solving Continuous-Time Sylvester Matrix Equations. |
Euro-Par |
2003 |
DBLP DOI BibTeX RDF |
Sylvester matrix equation, Bartels-Stewart method, ScaLAPACK-style algorithms, blocking, level 3 BLAS, continuous-time, GEMM-based, SLICOT |
47 | Bo Kågström |
Management of Deep Memory Hierarchies - Recursive Blocked Algorithms and Hybrid Data Structures for Dense Matrix Computations. |
PARA |
2004 |
DBLP DOI BibTeX RDF |
automatic variable blocking, hybrid data structures, superscalar kernels, SMP parallelization, library software, ESSL, RECSY, periodic systems, factorizations, recursion, superscalar, LAPACK, level 3 BLAS, dense linear algebra, GEMM-based, SLICOT, matrix equations |
28 | Isak Jonsson, Bo Kågström |
Recursive blocked algorithms for solving triangular systems - Part I: one-sided and coupled Sylvester-type matrix equations. |
ACM Trans. Math. Softw. |
2002 |
DBLP DOI BibTeX RDF |
SMP parallelization, generalized coupled Sylvester, standard Sylvester and Lyapunov, recursion, superscalar, LAPACK, level-3 BLAS, GEMM-based, SLICOT, Matrix equations, automatic blocking |
28 | Isak Jonsson, Bo Kågström |
Recursive blocked algorithms for solving triangular systems - Part II: two-sided and generalized Sylvester and Lyapunov matrix equations. |
ACM Trans. Math. Softw. |
2002 |
DBLP DOI BibTeX RDF |
SMP parallelization, generalized Sylvester and Lyapunov, standard discrete-time Sylvester and Lyapunov, recursion, superscalar, LAPACK, level-3 BLAS, GEMM-based, SLICOT, Matrix equations, automatic blocking |
23 | Jordi Fornt, Pau Fontova-Musté, Martí Caro, Jaume Abella 0001, Francesc Moll, Josep Altet, Christoph Studer |
An Energy-Efficient GeMM-Based Convolution Accelerator With On-the-Fly im2col. |
IEEE Trans. Very Large Scale Integr. Syst. |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Iryna De Albuquerque Silva, Thomas Carle, Adrien Gauffriau, Claire Pagetti |
Extending a predictable machine learning framework with efficient gemm-based convolution routines. |
Real Time Syst. |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Daniel Y. Fu, Simran Arora, Jessica Grogan, Isys Johnson, Sabri Eyuboglu, Armin W. Thomas, Benjamin Spector, Michael Poli, Atri Rudra, Christopher Ré |
Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Guosheng Yu, Zhihong Lv, Haijiang Wang, Zilong Huang, Jicheng Chen |
Task-aware Scheduling and Performance Optimization on Yitian710 SoC for GEMM-based Workloads on the Cloud. |
AICAS |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Daniel Y. Fu, Simran Arora, Jessica Grogan, Isys Johnson, Evan Sabri Eyuboglu, Armin W. Thomas, Benjamin Spector, Michael Poli, Atri Rudra, Christopher Ré |
Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture. |
NeurIPS |
2023 |
DBLP BibTeX RDF |
|
23 | Sergio Barrachina 0001, Manuel F. Dolz, Pablo San Juan, Enrique S. Quintana-Ortí |
Efficient and portable GEMM-based convolution operators for deep neural network training on multicore processors. |
J. Parallel Distributed Comput. |
2022 |
DBLP DOI BibTeX RDF |
|
23 | Chunhua Xiao, Chen Shi, Dandan Xu, Fangzhu Lin, Kun Ning |
SDST-Accelerating GEMM-based Convolution through Smart Data Stream Transformation. |
CCGRID |
2022 |
DBLP DOI BibTeX RDF |
|
23 | Andrew Anderson 0001, Aravind Vasudevan, Cormac Keane, David Gregg |
High-Performance Low-Memory Lowering: GEMM-based Algorithms for DNN Convolution. |
SBAC-PAD |
2020 |
DBLP DOI BibTeX RDF |
|
23 | Amarin Phaosawasdi, Christopher Rodrigues, Long Chen, Peng Wu 0001 |
CubeGen: Code Generation for Accelerated GEMM-Based Convolution with Tiling. |
LCPC |
2019 |
DBLP DOI BibTeX RDF |
|
23 | Andrew Anderson 0001, Aravind Vasudevan, Cormac Keane, David Gregg |
Low-memory GEMM-based convolution algorithms for deep neural networks. |
CoRR |
2017 |
DBLP BibTeX RDF |
|
23 | Fred G. Gustavson, André Henriksson, Isak Jonsson, Bo Kågström, Per Ling |
Superscalar GEMM-based Level 3 BLAS - The On-going Evolution of a Portable and High-Performance Library. |
PARA |
1998 |
DBLP DOI BibTeX RDF |
|
23 | Bo Kågström, Per Ling, Charles Van Loan |
Portable High Performance GEMM-Based Level 3 BLAS. |
PPSC |
1993 |
DBLP BibTeX RDF |
|
Displaying result #1 - #22 of 22 (100 per page; Change: )
|