|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 538 occurrences of 222 keywords
|
|
|
Results
Found 1755 publication records. Showing 1749 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
13 | Xuan Jiang, Laurence Lu, Linyue Song |
Incompressible Fluid Simulation Parallelization with OpenMP, MPI and CUDA. |
FICC (2) |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Manuel Costanzo, Enzo Rucci, Carlos García Sánchez 0001, Marcelo R. Naiouf, Manuel Prieto-Matías |
Comparing Performance and Portability Between CUDA and SYCL for Protein Database Search on NVIDIA, AMD, and Intel GPUs. |
SBAC-PAD |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Mohamed A. Elgammal, Omar Mohamed Awad, Isak Edo Vivancos, Andreas Moshovos, Vaughn Betz |
cuSCNN : an Efficient CUDA Implementation of Sparse CNNs. |
HEART |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Matthew Horvath, Michael Bowers, Shadi Alawneh |
Canny Edge Detection on GPU using CUDA. |
CCWC |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Lesia Mochurad, Ostap-Vasyl Matviiv, Halyna Lema, Roksolana Vilhutska |
CUDA-Based Algorithm for Lidar Position Determination in Mobile Robotics. |
MoMLeT+DS |
2023 |
DBLP BibTeX RDF |
|
13 | Swati Jindal, Xin Eric Wang |
CUDA-GHR: Controllable Unsupervised Domain Adaptation for Gaze and Head Redirection. |
WACV |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Brian Chen, Nafis Mustakin, Alvin Hoang, Sakib Fuad, Daniel Wong 0001 |
VSCuda: LLM based CUDA extension for Visual Studio Code. |
SC Workshops |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Jun Chen, Xule Zhou, Hyesoon Kim |
CuPBoP-AMD: Extending CUDA to AMD Platforms. |
SC Workshops |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Niklas Eiling, Stefan Lankes, Antonello Monti |
Checkpoint/Restart for CUDA Kernels. |
SC Workshops |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Christoph Weckert, Leonardo Solis-Vasquez, Julian Oppermann, Andreas Koch 0001, Oliver Sinnen |
Altis-SYCL: Migrating Altis Benchmarking Suite from CUDA to SYCL for GPUs and FPGAs. |
SC Workshops |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Brenda S. Schussler, Pedro H. C. Rigon, Arthur Francisco Lorenzon, Alexandre Carissimi, Philippe O. A. Navaux |
The Impact of CUDA Execution Configuration Parameters on the Performance and Energy of a Seismic Application. |
CARLA |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Dong Chen, Tanping Zhou, Wenchao Liu 0002, Zichen Zhou, Yujie Ding, Xiaoyuan Yang |
Construction of a Fully Homomorphic Encryption Scheme with Shorter Ciphertext and Its Implementation on the CUDA Platform. |
EIDWT |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Jonathan D. Wapman, Sean Treichler, Serban D. Porumbescu, John D. Owens |
Harmonic CUDA: Asynchronous Programming on GPUs. |
PMAM@PPoPP |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Ruobing Han, Jun Chen, Bhanu Garg, Jeffrey Young 0001, Jaewoong Sim, Hyesoon Kim |
CuPBoP: A Framework to Make CUDA Portable. |
PPoPP |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Sumyeong Ahn, Jongwoo Ko, Se-Young Yun |
CUDA: Curriculum of Data Augmentation for Long-tailed Recognition. |
ICLR |
2023 |
DBLP BibTeX RDF |
|
13 | Jin-Sung Kim, Alex McCaskey, Bettina Heim, Manish Modani, Sam Stanwyck, Timothy B. Costa |
CUDA Quantum: The Platform for Integrated Quantum-Classical Computing. |
DAC |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Alexander Brandt, Marc Moreno Maza, Taabish Jeshani, Linxiao Wang, Davood Mohajerani, Jeeva Paudel |
Dynamically Finding Optimal Kernel Launch Parameters for CUDA Programs. |
CASCON |
2023 |
DBLP BibTeX RDF |
|
13 | Robin Kobus |
Accelerating bioinformatics applications on CUDA-enabled multi-GPU systems. |
|
2023 |
RDF |
|
13 | Soyoon Bak, Philsu Kim, Sangbeom Park 0001 |
Development of a parallel CUDA algorithm for solving 3D guiding center problems. |
Comput. Phys. Commun. |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Fabio Bonaccorso, Marco Lauricella 0001, Andrea Montessori, Giorgio Amati, Massimo Bernaschi, Filippo Spiga, Adriano Tiribocchi, Sauro Succi |
LBcuda: A high-performance CUDA port of LBsoft for simulation of colloidal systems. |
Comput. Phys. Commun. |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Alankar Shantaram Shelar, Raj Kulkarni |
Swarm of Honey Bees for Association Rule Mining Using CUDA. |
Int. J. Softw. Innov. |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Pasquale De Luca, Ardelio Galletti, Livia Marcellino |
GPU-CUDA Implementation of the Third Order Gaussian Recursive Filter. |
SN Comput. Sci. |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Lesia I. Mochurad |
Canny Edge Detection Analysis Based on Parallel Algorithm, Constructed Complexity Scale and CUDA. |
Comput. Informatics |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Ahmet E. Topcu, Isameddin Omak |
Parallelization of a meteorological model using message passing interface and CUDA: A case study with the inversion estimation algorithm. |
Concurr. Comput. Pract. Exp. |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Niklas Eiling, Jonas Baude, Stefan Lankes, Antonello Monti |
Cricket: A virtualization layer for distributed execution of CUDA applications with checkpoint/restart support. |
Concurr. Comput. Pract. Exp. |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Dylan Matthew Janssen, Wayne Pullan, Alan Wee-Chung Liew |
Graphics processing unit acceleration of the island model genetic algorithm using the CUDA programming platform. |
Concurr. Comput. Pract. Exp. |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Anton A. Raskovalov, Platon Surkov |
azTotMD 2.0: Molecular dynamics with the radiative thermostat and temperature-dependent force field (CUDA version). |
SoftwareX |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Younsang Cho, Jaeoh Kim, Donghyeon Yu |
Comparative Study of CUDA GPU Implementations in Python With the Fast Iterative Shrinkage-Thresholding Algorithm for LASSO. |
IEEE Access |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Hendrik Schwanekamp, Ramona Hohl, Dmitry Chirkin, Tom Gibbs, Alexander Harnisch, Claudio Kopper, Peter Messmer, Vishal Mehta, Alexander R. Olivas, Benedikt Riedel, Martin Rongen, David Schultz, Jakob van Santen |
Accelerating IceCube's Photon Propagation Code with CUDA. |
Comput. Softw. Big Sci. |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Mohsen Safari, Marieke Huisman |
Formal verification of parallel prefix sum and stream compaction algorithms in CUDA. |
Theor. Comput. Sci. |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Kaijie Huang, Jie Cao 0014 |
Multi-prediction metropolis hastings resampling filtering algorithm based on CUDA. |
Microprocess. Microsystems |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Hengliang Guo, Bowen Xu, Hong Yang, Bingyang Li, Yuanyuan Yue, Shan Zhao |
CUDA-based parallelization of time-weighted dynamic time warping algorithm for time series analysis of remote sensing data. |
Comput. Geosci. |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Lukas Spies, Amanda Bienz, J. David Moulton, Luke N. Olson, Andrew Reisner |
Tausch: A halo exchange library for large heterogeneous computing systems using MPI, OpenCL, and CUDA. |
Parallel Comput. |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Johannes Pekkilä, Miikka S. Väisälä, Maarit J. Käpylä, Matthias Rheinhardt, Oskar Lappi |
Scalable communication for high-order stencil computations using CUDA-aware MPI. |
Parallel Comput. |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Jack L. B. Line |
'WODEN': A CUDA-enabled package to simulate low-frequency radio interferometric data. |
J. Open Source Softw. |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Ruobing Han, Jaewon Lee, Jaewoong Sim, Hyesoon Kim |
COX : Exposing CUDA Warp-level Functions to CPUs. |
ACM Trans. Archit. Code Optim. |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Shaohui Xie, Xiaotian He, Shan He 0001, Zexuan Zhu |
CURC: a CUDA-based reference-free read compressor. |
Bioinform. |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Ruobing Han, Jun Chen, Bhanu Garg, Jeffrey Young 0001, Jaewoong Sim, Hyesoon Kim |
CuPBoP: CUDA for Parallelized and Broad-range Processors. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Raymond Leung |
GPU implementation of a ray-surface intersection algorithm in CUDA (Compute Unified Device Architecture). |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Carl Pearson, Aurya Javeed, Karen D. Devine |
Machine Learning for CUDA+MPI Design Rules. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Ke Yue, Nicholas Schwarz, Jonathan Z. Tischler |
Accelerating Laue Depth Reconstruction Algorithm with CUDA. |
CoRR |
2022 |
DBLP BibTeX RDF |
|
13 | Manuel Costanzo, Enzo Rucci, Carlos García Sánchez 0001, Marcelo R. Naiouf, Manuel Prieto-Matías |
Migrating CUDA to oneAPI: A Smith-Waterman Case Study. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Wencheng Han, Hao Li 0009, Maoguo Gong, Jianzhao Li, Yiting Liu, Zhenkun Wang |
Multi-swarm particle swarm optimization based on CUDA for sparse reconstruction. |
Swarm Evol. Comput. |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Sorin Valcan, Mihail Gaianu |
CUDA Implementation For Eye Location On Infrared Images. |
Scalable Comput. Pract. Exp. |
2022 |
DBLP BibTeX RDF |
|
13 | Hao Yang, Shiyu Shen, Zhe Liu, Yunlei Zhao |
cuXCMP: CUDA-Accelerated Private Comparison Based on Homomorphic Encryption. |
IACR Cryptol. ePrint Arch. |
2022 |
DBLP BibTeX RDF |
|
13 | Shiyu Shen, Hao Yang, Yu Liu, Zhe Liu, Yunlei Zhao |
CUDA-Accelerated RNS Multiplication in Word-Wise Homomorphic Encryption Schemes. |
IACR Cryptol. ePrint Arch. |
2022 |
DBLP BibTeX RDF |
|
13 | Siva Kumar Pathuri, Neelamegam Anbazhagan, Gyanendra Prasad Joshi, Jinsang You |
Feature-Based Sentimental Analysis on Public Attention towards COVID-19 Using CUDA-SADBM Classification Model. |
Sensors |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Marc Jordà, Pedro Valero-Lara, Antonio J. Peña |
cuConv: CUDA implementation of convolution for CNN inference. |
Clust. Comput. |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Lesia Mochurad, Roman Bliakhar |
Comparison of the Efficiency of Parallel Algorithms KNN and NLM Based on CUDA for Large Image Processing. |
CMIS |
2022 |
DBLP BibTeX RDF |
|
13 | Khoa Ho, Hui Zhao 0013, Adwait Jog, Saraju P. Mohanty |
Improving GPU Throughput through Parallel Execution Using Tensor Cores and CUDA Cores. |
ISVLSI |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Simon B. Hengeveld, Antonio Mucherino |
A GPU approach to distance geometry in 1D: an implementation in C/CUDA. |
FedCSIS |
2022 |
DBLP DOI BibTeX RDF |
|
13 | David Defour |
Using scheduling entropy amplification in CUDA/OpenMP code to exhibit non-reproducibility issues. |
MCSoC |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Ajay Brahmakshatriya, Saman P. Amarasinghe |
GraphIt to CUDA Compiler in 2021 LOC: A Case for High-Performance DSL Implementation via Staging with BuilDSL. |
CGO |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Han Zhao 0005, Weihao Cui, Quan Chen 0002, Youtao Zhang, Yanchao Lu, Chao Li 0009, Jingwen Leng, Minyi Guo |
Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS. |
HPCA |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Hyuck Yi, SunHo Baek, JunSeong Kim |
Exploring Parallelism of a BRDF algorithm using CUDA. |
ICEIC |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Carl Pearson, Aurya Javeed, Karen D. Devine |
Machine Learning for CUDA+MPI Design Rules. |
IPDPS Workshops |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Vibhor Dodeja, Mohammad Almasri, Rakesh Nagi, Jinjun Xiong, Wen-Mei Hwu |
PARSEC: PARallel Subgraph Enumeration in CUDA. |
IPDPS |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Mark F. Adams, Dylan P. Brennan, Matthew G. Knepley, Peng Wang |
Landau collision operator in the CUDA programming model applied to thermal quench plasmas. |
IPDPS |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Zhiming Wang, Yury Plyakhin, Chenwei Sun, Ziran Zhang, Zhiwei Jiang, Andy Huang, Hao Wang |
A source-to-source CUDA to SYCL code migration tool: Intel® DPC++ Compatibility Tool. |
IWOCL |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Marcel Breyer, Alexander Van Craen, Dirk Pflüger |
A Comparison of SYCL, OpenCL, CUDA, and OpenMP for Massively Parallel Support Vector Machine Classification on Multi-Vendor Hardware. |
IWOCL |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Kathryn Fontaine, Ping Luo, Richard E. Carson |
Accelerating PET Image Reconstruction with CUDA. |
PEARC |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Vandana Bharti, Aryan Singhal, Anant Saxena, Bhaskar Biswas, Kaushal Kumar Shukla |
Parallelization of corner sort with CUDA for many-objective optimization. |
GECCO |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Chenyu Wang, Toshio Endo, Takahiro Hirofuchi, Tsutomu Ikegami |
Speed-up Single Shot Detector on GPU with CUDA. |
SNPD-Summer |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Raffaele Montella, Diana Di Luccio, Ciro Giuseppe De Vita, Gennaro Mellone, Marco Lapegna, Giuliano Laccetti, Sokol Kosta, Giulio Giunta |
Enabling the CUDA Unified Memory model in Edge, Cloud and HPC offloaded GPU kernels. |
CCGRID |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Bin Huang, Anjun Liu, Min Tian, Jingshan Pan, Yu Zhang |
Parallel Performance and Optimization of the Lattice Boltzmann Method Software Palabos Using CUDA. |
HP3C |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Imene Guerfi, Lobna Kriaa, Leïla Azouz Saïdane |
Towards Automatic Block Size Tuning for Image Processing Algorithms on CUDA. |
ICSOFT |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Manuel Costanzo, Enzo Rucci, Carlos García Sánchez 0001, Marcelo R. Naiouf, Manuel Prieto-Matías |
Migrating CUDA to oneAPI: A Smith-Waterman Case Study. |
IWBBIO (2) |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Dian-Lun Lin, Haoxing Ren, Yanqing Zhang 0002, Brucek Khailany, Tsung-Wei Huang |
From RTL to CUDA: A GPU Acceleration Flow for RTL Simulation with Batch Stimulus. |
ICPP |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Topi Miekkala, Matti Kutila, Mathias Schneider, Alfred Höß |
Optimizing 3D Object Detection for Embedded Systems in Automated Vehicles Using Sensor Data Fusion and CUDA Computing. |
ICCP |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Bernhard Kerbl, Michael Kenzel, Martin Winter, Markus Steinberger |
CUDA and Applications to Task-based Programming. |
Eurographics (Tutorials) |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Shang WanXin, Tao Wu 0010, Yang Fei, Xi Chen 0026, Jingjue Chen, Zhenxia Yu |
CUDA Acceleration of Worst-Case Execution Time Analysis Based On Model Checking. |
CBD |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Konrad Jalowiecki, Marek M. Rams, Bartlomiej Gardas |
Brute-forcing spin-glass problems with CUDA. |
Comput. Phys. Commun. |
2021 |
DBLP DOI BibTeX RDF |
|
13 | Sheng-Chun Yang, Yong-Lei Wang |
A hybrid MPI-CUDA approach for nonequispaced discrete Fourier transformation. |
Comput. Phys. Commun. |
2021 |
DBLP DOI BibTeX RDF |
|
13 | Stefan K. Muller, Jan Hoffmann 0002 |
Modeling and analyzing evaluation cost of CUDA kernels. |
Proc. ACM Program. Lang. |
2021 |
DBLP DOI BibTeX RDF |
|
13 | Xianyun Wu, Keyan Wang, Yunsong Li, Kai Liu 0021, Bormin Huang |
Accelerating Haze Removal Algorithm Using CUDA. |
Remote. Sens. |
2021 |
DBLP DOI BibTeX RDF |
|
13 | Edgar Josafat Martinez-Noriega, Syunji Yazaki, Tetsu Narumi |
CUDA offloading for energy-efficient and high-frame-rate simulations using tablets. |
Concurr. Comput. Pract. Exp. |
2021 |
DBLP DOI BibTeX RDF |
|
13 | Edoardo Coronado-Barrientos, Mario Antonioletti, Antonio J. García-Loureiro |
A new AXT format for an efficient SpMV product using AVX-512 instructions and CUDA. |
Adv. Eng. Softw. |
2021 |
DBLP DOI BibTeX RDF |
|
13 | Liang-Tsung Huang, Kai-Cheng Wei, Chao-Chin Wu, Chao-Yu Chen, Jian-An Wang |
A lightweight BLASTP and its implementation on CUDA GPUs. |
J. Supercomput. |
2021 |
DBLP DOI BibTeX RDF |
|
13 | R. Quintero-Monsebaiz, Amilcar Meneses-Viveros, F. Carranza, C. G. Cortés, A. González-Zamudio, A. Vela |
Multidimensional adaptative and deterministic integration in CUDA and OpenMP. |
J. Supercomput. |
2021 |
DBLP DOI BibTeX RDF |
|
13 | Leonardo Rundo, Andrea Tangherloni, Paolo Cazzaniga, Matteo Mistri, Simone Galimberti, Ramona Woitek, Evis Sala, Giancarlo Mauri, Marco S. Nobile |
A CUDA-powered method for the feature extraction and unsupervised analysis of medical images. |
J. Supercomput. |
2021 |
DBLP DOI BibTeX RDF |
|
13 | Benjamín Salomón Noyola-García, Suemi Rodríguez-Romo |
Simulations of Ga melting based on multiple-relaxation time lattice Boltzmann method performed with CUDA in Python. |
Math. Comput. Simul. |
2021 |
DBLP DOI BibTeX RDF |
|
13 | M. Ali Asan, Adnan Ozsoy |
cuRCD: Region covariance descriptor CUDA implementation. |
Multim. Tools Appl. |
2021 |
DBLP DOI BibTeX RDF |
|
13 | Jingrong Zhang, Zihao Wang, Zhiyong Liu 0002, Fa Zhang 0001 |
Improve the Resolution and Parallel Performance of the Three-Dimensional Refine Algorithm in RELION Using CUDA and MPI. |
IEEE ACM Trans. Comput. Biol. Bioinform. |
2021 |
DBLP DOI BibTeX RDF |
|
13 | Laura Antonelli, Elisa Francomano, Francesco Gregoretti |
A CUDA-based implementation of an improved SPH method on GPU. |
Appl. Math. Comput. |
2021 |
DBLP DOI BibTeX RDF |
|
13 | Songhai Fan, Yiyu Gong, Gexiang Zhang, Yun Xiao, Haina Rong, Prithwineel Paul, Xiaomin Ma, Han Huang, Marian Gheorghe 0001 |
Implementation of Kernel P Systems in CUDA for Solving NP-hard Problems. |
Int. J. Unconv. Comput. |
2021 |
DBLP BibTeX RDF |
|
13 | Luyan Liu, Zhengdong Zhang, Shuai Li 0001, Kai Ma 0002, Yefeng Zheng 0001 |
S-CUDA: Self-cleansing unsupervised domain adaptation for medical image segmentation. |
Medical Image Anal. |
2021 |
DBLP DOI BibTeX RDF |
|
13 | Xue Sun, Chao-Chin Wu, Yan-Fang Liu |
The Design and Implementation of an Improved Lightweight BLASTP on CUDA GPU. |
Symmetry |
2021 |
DBLP DOI BibTeX RDF |
|
13 | Bartosz Kohnke, Carsten Kutzner, Andreas Beckmann, Gert Lube, Ivo Kabadshow, Holger Dachsel, Helmut Grubmüller |
A CUDA fast multipole method with highly efficient M2L far field evaluation. |
Int. J. High Perform. Comput. Appl. |
2021 |
DBLP DOI BibTeX RDF |
|
13 | Nikolay Kondratyuk, Vsevolod P. Nikolskiy, Daniil Pavlov, Vladimir V. Stegailov |
GPU-accelerated molecular dynamics: State-of-art software performance and porting from Nvidia CUDA to AMD HIP. |
Int. J. High Perform. Comput. Appl. |
2021 |
DBLP DOI BibTeX RDF |
|
13 | Manuel Costanzo, Enzo Rucci, Carlos García Sánchez 0001, Marcelo R. Naiouf |
Early Experiences Migrating CUDA codes to oneAPI. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
13 | Ruobing Han, Blaise Tine, Jaewon Lee, Jaewoong Sim, Hyesoon Kim |
Supporting CUDA for an extended RISC-V GPU architecture. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
13 | Jiri Filipovic, Jana Hozzová, Amin Nezarat, Jaroslav Olha, Filip Petrovic |
Searching CUDA code autotuning spaces with hardware performance counters: data from benchmarks running on various GPU architectures. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
13 | Christopher A. Metz, Mehran Goli, Rolf Drechsler |
Pick the Right Edge Device: Towards Power and Performance Estimation of CUDA-based CNNs on GPGPUs. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
13 | Boitumelo Ruf, Jonas Mohrs, Martin Weinmann, Stefan Hinz, Jürgen Beyerer |
ReS2tAC - UAV-Borne Real-Time SGM Stereo Optimized for Embedded ARM and CUDA Devices. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
13 | Ruobing Han, Jaewon Lee, Jaewoong Sim, Hyesoon Kim |
COX: CUDA on X86 by Exposing Warp-Level Functions to CPUs. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
13 | Jan Novotný, Karel Adámek, Wes Armour |
Implementing CUDA Streams into AstroAccelerate - A Case Study. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
13 | Marc Jordà, Pedro Valero-Lara, Antonio J. Peña |
cuConv: A CUDA Implementation of Convolution for CNN Inference. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
13 | Swati Jindal, Xin Eric Wang |
CUDA-GR: Controllable Unsupervised Domain Adaptation for Gaze Redirection. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
13 | Guillermo Oyarzun, Daniel Mira, Guillaume Houzeaux |
Performance assessment of CUDA and OpenACC in large scale combustion simulations. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
13 | Patrick Diehl, Gregor Daiß, Dominic Marcello, Kevin A. Huck, Sagiv Shiber, Hartmut Kaiser, Juhan Frank, Dirk Pflüger |
Octo-Tiger's New Hydro Module and Performance Using HPX+CUDA on ORNL's Summit. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
Displaying result #301 - #400 of 1749 (100 per page; Change: ) Pages: [ <<][ 1][ 2][ 3][ 4][ 5][ 6][ 7][ 8][ 9][ 10][ 11][ 12][ 13][ >>] |
|