Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
16 | Yi-Min Wang, Hsiao-Hsi Wang, Ruei-Chuan Chang |
Classifying and alleviating the communication overheads in matrix computations on large-scale NUMA multiprocessors. |
J. Syst. Softw. |
1998 |
DBLP DOI BibTeX RDF |
|
16 | Yukio Ohishi, Keizo Saisho, Akira Fukuda |
Performance evaluation of Two-level Scheduling algorithms for NUMA multiprocessors. |
Syst. Comput. Jpn. |
1998 |
DBLP DOI BibTeX RDF |
|
16 | Edward D. Moreno, Sergio Takeo Kofuji |
Improvements on bus technology will affect the benefits of remote caches in CC-NUMA architectures. |
CATA |
1998 |
DBLP BibTeX RDF |
|
16 | Yunheung Paek, David A. Padua |
Experimental Study of Compiler Techniques for NUMA Machines. |
IPPS/SPDP |
1998 |
DBLP DOI BibTeX RDF |
|
16 | Hung-Chang Hsiao, Chung-Ta King |
Performance Evaluation of Cache Depot on CC-NUMA Multiprocessors. |
ICPADS |
1998 |
DBLP DOI BibTeX RDF |
|
16 | Robert Geist |
Performance Bounds for Modeling NUMA Architectures. |
Inf. Process. Lett. |
1997 |
DBLP DOI BibTeX RDF |
|
16 | David R. Kaeli, Liana L. Fong, Richard C. Booth, Kerry C. Imming, Joseph P. Weigel |
Performance analysis on a CC-NUMA prototype. |
IBM J. Res. Dev. |
1997 |
DBLP DOI BibTeX RDF |
|
16 | Guan-Joe Lai, Cheng Chen |
Scheduling Parallel Program Tasks with Non-negligible Intertask Communications on to Numa Multiprocessor Systems. |
Parallel Algorithms Appl. |
1997 |
DBLP DOI BibTeX RDF |
|
16 | Sakti Pramanik, Walid R. Tout |
The NUMA with Clusters of Processors for Parallel Join. |
IEEE Trans. Knowl. Data Eng. |
1997 |
DBLP DOI BibTeX RDF |
shared-everything architecture, load-balancing, parallel processing, shared-nothing architecture, Relational join |
16 | Yi-Min Wang, Hsiao-Hsi Wang, Ruei-Chuan Chang |
Clustered affinity scheduling on large-scale NUMA multiprocessors. |
J. Syst. Softw. |
1997 |
DBLP DOI BibTeX RDF |
|
16 | Sivarama P. Dandamudi, Philip S. P. Cheng |
Performance impact of run queue organization and synchronization on large-scale NUMA multiprocessor systems. |
J. Syst. Archit. |
1997 |
DBLP DOI BibTeX RDF |
|
16 | Marcus Dormanns, Walter Sprangers, Hubert Ertl, Thomas Bemmerl |
A Programming Interface for NUMA Shared-Memory Clusters. |
HPCN |
1997 |
DBLP DOI BibTeX RDF |
|
16 | James Westall, Robert Geist |
A Hybrid Tool for the Performance Evaluation of NUMA Architectures. |
WSC |
1997 |
DBLP DOI BibTeX RDF |
|
16 | Radhika Thekkath, Amit Pal Singh, Jaswinder Pal Singh, Susan John, John L. Hennessy |
An Evaluation of a Commercial CC-NUMA Architecture - The CONVEX Exemplar SPP1200. |
IPPS |
1997 |
DBLP DOI BibTeX RDF |
|
16 | Luc Bouganim, Daniela Florescu, Patrick Valduriez |
Multi-Join Query Execution with Skew in NUMA Multiprocessors. |
BDA |
1997 |
DBLP BibTeX RDF |
|
16 | David Xiaowei Wang |
New Scalable Parallel Computer Architecture - Non-Uniform Memory Access (NUMA-Q). |
PDPTA |
1997 |
DBLP BibTeX RDF |
|
16 | Tim Brecht |
An Experimental Evaluation of Processor Pool-Based Scheduling for Shared-Memory NUMA Multiprocessors. |
JSSPP |
1997 |
DBLP DOI BibTeX RDF |
|
16 | Yan Yang Xiao, John K. Bennett |
Memory Organization in Multi-Channel Optical Networks: NUMA and COMA Revisited. |
International Conference on Supercomputing |
1996 |
DBLP DOI BibTeX RDF |
|
16 | Jarek Nieplocha, Robert J. Harrison |
Shared Memory NUMA Programming on I-WAY. |
HPDC |
1996 |
DBLP DOI BibTeX RDF |
|
16 | Tarek S. Abdelrahman, Kenneth L. Ma |
Evaluation of Dynamic Data Distributions on NUMA Shared Memory Multiprocessors. |
PDPTA |
1996 |
DBLP BibTeX RDF |
|
16 | Jong Woo Lee, Yookun Cho |
Using Adjustable DELAY Counter for Page Replication in NUMA Multiprocessors. |
Parallel and Distributed Computing and Systems |
1995 |
DBLP BibTeX RDF |
|
16 | Isabel Abranches Viegas, Rui da Silva Marques |
A Qualidade de Serviço: Um Conceito Global numa Empresa de Comunicações Globais. |
QUATIC |
1995 |
DBLP BibTeX RDF |
|
16 | Feixiong Liu, Thomas Peikenkamp, Werner Damm |
An Extended Gradient Model for NUMA Multiprocessor Systems. |
ASIAN |
1995 |
DBLP DOI BibTeX RDF |
|
16 | Kenneth C. Sevcik, Songnian Zhou |
Performance Benefits and Limitations of Large NUMA Multiprocessors. |
Perform. Evaluation |
1994 |
DBLP DOI BibTeX RDF |
|
16 | Xiaodong Zhang 0001, Yong Yan 0003 |
Modeling Data Migration on CC-NUMA and CC-COMA Hierarchical Ring Architectures. |
MASCOTS |
1994 |
DBLP DOI BibTeX RDF |
|
16 | Karim Harzallah, Kenneth C. Sevcik |
Evaluating Memory System Performance of a Large-Scale NUMA Multiprocessor. |
MASCOTS |
1994 |
DBLP DOI BibTeX RDF |
|
16 | Ronald C. Unrau, Orran Krieger, Benjamin Gamsa, Michael Stumm |
Experiences with Locking in a NUMA Multiprocessor Operating System Kernel. |
OSDI |
1994 |
DBLP BibTeX RDF |
|
16 | Santosh Pande, Kleanthis Psarris |
A Compilation Technique for Varying Communication Cost NUMA Architectures. |
PARLE |
1994 |
DBLP DOI BibTeX RDF |
|
16 | Xiaodong Zhang 0001, Yong Yan 0003 |
Latency Analysis of CC-NUMA and CC-COMA Rings. |
ICPP (1) |
1994 |
DBLP DOI BibTeX RDF |
|
16 | Timothy B. Brecht |
Multiprogrammed parallel application scheduling in NUMA multiprocessors. |
|
1994 |
RDF |
|
16 | Wei Li 0015, Keshav Pingali |
Access Normalization: Loop Restructuring for NUMA Compilers. |
ACM Trans. Comput. Syst. |
1993 |
DBLP DOI BibTeX RDF |
nonsingular loop transformation, nonuniform memory access machines, parallelizing compilers, data locality, loop transformation |
16 | Richard Wolski, John Feo |
Program Partitioning for NUMA Multiprocessor Computer Systems. |
J. Parallel Distributed Comput. |
1993 |
DBLP DOI BibTeX RDF |
|
16 | Akira Fukuda, Ryousuke Fujiki, Hisa-aki Kai |
Two-level processor scheduling for multiprogrammed NUMA multiprocessors. |
COMPSAC |
1993 |
DBLP DOI BibTeX RDF |
|
16 | Dannie Durand, Thierry Montaut, Lionel Kervella, William Jalby |
Impact of Memory Contention on Dynamic Scheduling on NUMA Multiprocessors. |
ICPP (1) |
1993 |
DBLP DOI BibTeX RDF |
|
16 | Hui Li, Sudarsan Tandri, Michael Stumm, Kenneth C. Sevcik |
Locality and Loop Scheduling on NUMA Multiprocessors. |
ICPP (2) |
1993 |
DBLP DOI BibTeX RDF |
|
16 | Wei Li 0015 |
Compiling for NUMA Parallel Machines. |
|
1993 |
RDF |
|
16 | Wei Li 0015, Keshav Pingali |
Loop Transformations for NUMA Machines. |
SIGPLAN Workshop |
1992 |
DBLP DOI BibTeX RDF |
|
16 | Jayashree Ramanathan, Lionel M. Ni |
Exploiting Data Exchange Patterns in Creating Objects for NUMA Shared Virtual Memory Systems. |
ICPP (2) |
1992 |
DBLP BibTeX RDF |
|
16 | Per Stenström, Truman Joe, Anoop Gupta |
Comparative Performance Evaluation of Cache-Coherent NUMA and COMA Architectures. |
ISCA |
1992 |
DBLP DOI BibTeX RDF |
|
16 | Wei Li 0015, Keshav Pingali |
Access Normalization: Loop Restructuring for NUMA Compilers. (long version: TOCS 11(4): 353-375) |
ASPLOS |
1992 |
DBLP DOI BibTeX RDF |
|
16 | Richard P. LaRowe Jr., Carla Schlatter Ellis |
Experimental Comparison of Memory Management Policies for NUMA Multiprocessors. |
ACM Trans. Comput. Syst. |
1991 |
DBLP DOI BibTeX RDF |
|
16 | Richard P. LaRowe Jr., Carla Schlatter Ellis |
Page Placement Policies for NUMA Multiprocessors. |
J. Parallel Distributed Comput. |
1991 |
DBLP DOI BibTeX RDF |
|
16 | Shyam Mudambi |
Performances of Aurora on NUMA Machines. |
ICLP |
1991 |
DBLP BibTeX RDF |
|
16 | Xiaodong Zhang 0001 |
Dynamic and static load scheduling performance on a NUMA shared memory multiprocessor. |
ICS |
1991 |
DBLP DOI BibTeX RDF |
|
16 | Richard P. LaRowe Jr., James T. Wilkes, Carla Schlatter Ellis |
Exploiting Operating System Support for Dynamic Page Placement on a NUMA Shared Memory Multiprocessor. |
PPoPP |
1991 |
DBLP DOI BibTeX RDF |
|
16 | William J. Bolosky, Michael L. Scott, Robert P. Fitzgerald, Robert J. Fowler, Alan L. Cox |
NUMA Policies and Their Relation to Memory Architecture. |
ASPLOS |
1991 |
DBLP DOI BibTeX RDF |
|
16 | William J. Bolosky, Robert P. Fitzgerald, Michael L. Scott |
Simple But Effective Techniques for NUMA Memory Management. |
SOSP |
1989 |
DBLP DOI BibTeX RDF |
|
12 | Steven A. Hofmeyr, Costin Iancu, Filip Blagojevic |
Load balancing on speed. |
PPoPP |
2010 |
DBLP DOI BibTeX RDF |
load balancing, operating systems, parallel applications |
12 | Cosmin E. Oancea, Alan Mycroft, Stephen M. Watt |
A new approach to parallelising tracing algorithms. |
ISMM |
2009 |
DBLP DOI BibTeX RDF |
memory-centric tracing algorithm, parallel |
12 | John L. Henning |
SPECrate2006: Alternatives Considered, Lessons Learned. |
SPEC Benchmark Workshop |
2009 |
DBLP DOI BibTeX RDF |
|
12 | Huandong Wang, Dan Tang, Xiang Gao, Yunji Chen |
An Enhanced HyperTransport Controller with Cache Coherence Support for Multiple-CMP. |
NAS |
2009 |
DBLP DOI BibTeX RDF |
|
12 | Brice Goglin, Nathalie Furmento |
Enabling high-performance memory migration for multithreaded applications on LINUX. |
IPDPS |
2009 |
DBLP DOI BibTeX RDF |
|
12 | Krishna Chaitanya Kandalla, Hari Subramoni, Gopalakrishnan Santhanaraman, Matthew J. Koop, Dhabaleswar K. Panda 0001 |
Designing multi-leader-based Allgather algorithms for multi-core clusters. |
IPDPS |
2009 |
DBLP DOI BibTeX RDF |
|
12 | Vasileios Karakasis, Georgios I. Goumas, Nectarios Koziris |
Exploring the effect of block shapes on the performance of sparse kernels. |
IPDPS |
2009 |
DBLP DOI BibTeX RDF |
|
12 | Gilbert Accary, Oleg Bessonov, Dominique Fougère, Konstantin Gavrilov, Sofiane Meradji, Dominique Morvan |
Efficient Parallelization of the Preconditioned Conjugate Gradient Method. |
PaCT |
2009 |
DBLP DOI BibTeX RDF |
|
12 | Stavros Passas, Kostas Magoutis, Angelos Bilas |
Towards 100 gbit/s ethernet: multicore-based parallel communication protocol design. |
ICS |
2009 |
DBLP DOI BibTeX RDF |
100 gbit/s ethernet, communication protocol design, multicore cpus, performance evaluation |
12 | Subhash Saini, Andrey Naraikin, Rupak Biswas, David Barkai, Timothy Sandstrom |
Early performance evaluation of a "Nehalem" cluster using scientific and engineering applications. |
SC |
2009 |
DBLP DOI BibTeX RDF |
|
12 | Haris Volos 0001, Andres Jaan Tack, Neelam Goyal, Michael M. Swift, Adam Welc |
xCalls: safe I/O in memory transactions. |
EuroSys |
2009 |
DBLP DOI BibTeX RDF |
xCalls, transactional memory, concurrent programming, I/O, system calls |
12 | Greg Bronevetsky, John C. Gyllenhaal, Bronis R. de Supinski |
CLOMP: Accurately Characterizing OpenMP Application Overheads. |
IWOMP |
2008 |
DBLP DOI BibTeX RDF |
|
12 | Wanxia Qu, Yang Guo 0003, Zhengbin Pang, Xiaodong Yang |
Efficient Verification of Parameterized Cache Coherence Protocols. |
ICYCS |
2008 |
DBLP DOI BibTeX RDF |
|
12 | Christophe Jaillet, Michaël Krajecki |
A New Memory Allocation Model for Parallel Search Space Data Structures with OpenMP. |
IWOMP |
2007 |
DBLP DOI BibTeX RDF |
shared memory, OpenMP, Parallel systems, memory allocation |
12 | Hassan Chafi, Jared Casper, Brian D. Carlstrom, Austen McDonald, Chi Cao Minh, Woongki Baek, Christos Kozyrakis, Kunle Olukotun |
A Scalable, Non-blocking Approach to Transactional Memory. |
HPCA |
2007 |
DBLP DOI BibTeX RDF |
|
12 | Everton Carara, Aline Mello 0001, Fernando Moraes 0001 |
Communication Models in Networks-on-Chip. |
IEEE International Workshop on Rapid System Prototyping |
2007 |
DBLP DOI BibTeX RDF |
|
12 | Michele Pittau, Andrea Alimonda, Salvatore Carta, Andrea Acquaviva |
Impact of Task Migration on Streaming Multimedia for Embedded Multiprocessors: A Quantitative Evaluation. |
ESTIMedia |
2007 |
DBLP DOI BibTeX RDF |
|
12 | Jeff Gilchrist, Aysegul Cuhadar |
Parallel Lossless Data Compression Based on the Burrows-Wheeler Transform. |
AINA |
2007 |
DBLP DOI BibTeX RDF |
|
12 | Hong Ong, Jeffrey S. Vetter, R. Scott Studham, Collin McCurdy, Bruce Walker, Alan L. Cox |
Kernel-level single system image for petascale computing. |
ACM SIGOPS Oper. Syst. Rev. |
2006 |
DBLP DOI BibTeX RDF |
SSI, scalability, availability, kernel |
12 | Bianca Schroeder, Garth A. Gibson |
A large-scale study of failures in high-performance computing systems. |
DSN |
2006 |
DBLP DOI BibTeX RDF |
|
12 | Siham Tabik, Ester M. Garzón, Inmaculada García, José-Jesús Fernández |
Evaluation of Parallel Paradigms on Anisotropic Nonlinear Diffusion. |
Euro-Par |
2006 |
DBLP DOI BibTeX RDF |
|
12 | Victor Luchangco, Daniel Nussbaum, Nir Shavit |
A Hierarchical CLH Queue Lock. |
Euro-Par |
2006 |
DBLP DOI BibTeX RDF |
|
12 | Amith R. Mamidala, Lei Chai, Hyun-Wook Jin, Dhabaleswar K. Panda 0001 |
Efficient SMP-aware MPI-level broadcast over InfiniBand's hardware multicast. |
IPDPS |
2006 |
DBLP DOI BibTeX RDF |
|
12 | Ahmed Abdelkhalek 0002, Tarek S. Abdelrahman |
Locality management using multiple SPMs on the Multi-Level Computing Architecture. |
ESTIMedia |
2006 |
DBLP DOI BibTeX RDF |
|
12 | Ralf Gruber, Vincent Keller, Emmanuel Leriche, Marc-Antoine Habisreutinger |
Can a Helmholtz solver run on a cluster? |
CLUSTER |
2006 |
DBLP DOI BibTeX RDF |
|
12 | Juan Rubio 0001, Lizy Kurian John |
Reducing Server Data Traffic Using a Hierarchical Computation Model. |
IEEE Trans. Parallel Distributed Syst. |
2005 |
DBLP DOI BibTeX RDF |
I/O interconnections topology, modeling, evaluation, databases, measurement, Distributed architectures, simulation of multiple-processor systems |
12 | Constantine Katsinis |
Block Migration in Broadcast-based Multiprocessor Architectures. |
NCA |
2005 |
DBLP DOI BibTeX RDF |
|
12 | Alberto Ros 0001, Manuel E. Acacio, José M. García 0001 |
A Novel Lightweight Directory Architecture for Scalable Shared-Memory Multiprocessors. |
Euro-Par |
2005 |
DBLP DOI BibTeX RDF |
|
12 | Thomas H. Dunigan, Jeffrey S. Vetter, Patrick H. Worley |
Performance Evaluation of the SGI Altix 3700. |
ICPP |
2005 |
DBLP DOI BibTeX RDF |
|
12 | Julian Borrill, Jonathan Carter, Leonid Oliker, David Skinner, Rupak Biswas |
Integrated Performance Monitoring of a Cosmology Application on Leading HEC Platforms. |
ICPP |
2005 |
DBLP DOI BibTeX RDF |
Cosmic Microwave Background, MADCAP, Altix Columbia, Earth Simulator, X1 Phoenix, Power3 Seaborg, parallel performance characterization |
12 | K. Korotaev |
Hierarchical CPU Schedulers for Multiprocessor Systems, Fair CPU Scheduling and Processes Isolation. |
CLUSTER |
2005 |
DBLP DOI BibTeX RDF |
|
12 | Manuel E. Acacio, José González 0002, José M. García 0001, José Duato |
An Architecture for High-Performance Scalable Shared-Memory Multiprocessors Exploiting On-Chip Integration. |
IEEE Trans. Parallel Distributed Syst. |
2004 |
DBLP DOI BibTeX RDF |
|
12 | Juan Carlos Pichel, Dora Blanco Heras, José Carlos Cabaleiro, Francisco F. Rivera |
Improving the Locality of the Sparse Matrix-Vector Product on Shared Memory Multiprocessors. |
PDP |
2004 |
DBLP DOI BibTeX RDF |
|
12 | Kiyofumi Tanaka, Toshihide Hagiwara |
A Scalable and Adaptive Directory Scheme for Hardware Distributed Shared Memory. |
Asia-Pacific Computer Systems Architecture Conference |
2004 |
DBLP DOI BibTeX RDF |
|
12 | John P. Sustersic, Ali R. Hurson |
A Quality of Service (QoS) Implementation of Internet Cache Coherence. |
AINA (1) |
2004 |
DBLP DOI BibTeX RDF |
|
12 | Angelos Bilas, Courtney R. Gibson, Reza Azimi, Rosalia Christodoulopoulou, Peter Jamieson |
Using System Emulation to Model Next-Generation Shared Virtual Memory Clusters. |
Clust. Comput. |
2003 |
DBLP DOI BibTeX RDF |
high-bandwidth interconnects, distributed shared memory, parallel systems, clusters of workstations, low-latency |
12 | Sung Woo Chung, Hyong-Shik Kim, Chu Shik Jhon |
Distance-aware L2 Cache Organizations for Scalable Multiprocessor Systems. |
DSD |
2003 |
DBLP DOI BibTeX RDF |
|
12 | Martin Schulz 0001, Sally A. McKee |
A Framework for Portable Shared Memory Programming. |
IPDPS |
2003 |
DBLP DOI BibTeX RDF |
|
12 | Julita Corbalán, Xavier Martorell, Jesús Labarta |
Evaluation of the memory page migration influence in the system performance: the case of the SGI O2000. |
ICS |
2003 |
DBLP DOI BibTeX RDF |
memory page migrations, performance evaluation, operating systems, scheduling algorithms |
12 | Prasad Jayanti |
Adaptive and efficient abortable mutual exclusion. |
PODC |
2003 |
DBLP DOI BibTeX RDF |
|
12 | Pawel Hajto, Marcin Skrzypek |
Wavelet-Neuronal Resource Load Prediction for Multiprocessor Environment. |
PPAM |
2003 |
DBLP DOI BibTeX RDF |
|
12 | Yunheung Paek, Angeles G. Navarro, Emilio L. Zapata, Jay P. Hoeflinger, David A. Padua |
An Advanced Compiler Framework for Non-Cache-Coherent Multiprocessors. |
IEEE Trans. Parallel Distributed Syst. |
2002 |
DBLP DOI BibTeX RDF |
array privatization, noncoherent caches, Put/Get, compiler, multiprocessors, dependence analysis, shared-memory programming |
12 | Valentin Puente, José A. Gregorio, Ramón Beivide |
SICOSYS: An Integrated Framework for studying Interconnection Network Performance in Multiprocessor Systems. |
PDP |
2002 |
DBLP DOI BibTeX RDF |
|
12 | Marcelo H. Cintra, Josep Torrellas |
Speculative Multithreading Eliminating Squashes through Learning Cross-Thread Violations in Speculative Parallelization for Multiprocessors. |
HPCA |
2002 |
DBLP DOI BibTeX RDF |
Shared-Memory Multiprocessors, Speculative Parallelization |
12 | Gavril Godza, Valentin Cristea |
Comparative Study of COW and SMP Computer Configurations. |
PARELEC |
2002 |
DBLP DOI BibTeX RDF |
genetic algorithms, parallel computers, distributed algorithms, evolutionary computing, message passing |
12 | Rolf Rabenseifner |
Communication Bandwidth of Parallel Programming Models on Hybrid Architectures. |
ISHPC |
2002 |
DBLP DOI BibTeX RDF |
Threads and MPI, MPI, OpenMP, HPC, Hybrid Parallel Programming |
12 | R. M. Schoemaker, P. C. A. de Haas, H. J. H. Clercx, Robert M. M. Mattheij |
Contour Dynamics Simulations with a Parallel Hierarchical-Element Method. |
International Conference on Computational Science (1) |
2002 |
DBLP DOI BibTeX RDF |
|
12 | Jie Tao 0001, Martin Schulz 0001, Wolfgang Karl |
Improving Data Locality Using Dynamic Page Migration Based on Memory Access Histograms. |
International Conference on Computational Science (2) |
2002 |
DBLP DOI BibTeX RDF |
|
12 | Prasad Jayanti |
f-arrays: implementation and applications. |
PODC |
2002 |
DBLP DOI BibTeX RDF |
|
12 | Patrick Keane, Mark Moir |
A Simple Local-Spin Group Mutual Exclusion Algorithm. |
IEEE Trans. Parallel Distributed Syst. |
2001 |
DBLP DOI BibTeX RDF |
scalable, synchronization, shared memory, Mutual exclusion, local spinning |
12 | Manuel E. Acacio, José González 0002, José M. García 0001, José Duato |
A New Scalable Directory Architecture for Large-Scale Multiprocessors. |
HPCA |
2001 |
DBLP DOI BibTeX RDF |
|
12 | Barbara M. Chapman, Oscar R. Hernandez, Amit Patil, Achal Prabhakar |
Program Development Environment for OpenMP Programs on ccNUMA Architectures. |
LSSC |
2001 |
DBLP DOI BibTeX RDF |
ccNUMA architectures, programming environments, OpenMP, data distribution, data locality, restructuring, software distributed shared memory, shared memory parallel programming |
12 | Josef Weidendorfer, Peter Luksch 0001 |
A Framework for Transparent Load Balancing in Parallel Numerical Simulation. |
Annual Simulation Symposium |
2001 |
DBLP DOI BibTeX RDF |
|