Sign up & Download
Sign in

Richard Vuduc's Publications

Assistant Professor, Georgia Institute of Technology
Atlanta, Georgia, United States

Research field: Computer and Information Science
High-performance computing, parallel algorithms and programming models, automated performance tuning (autotuning), debugging
Report (3) | Thesis (1) | Conference Proceedings (39) | Journal Article (9)

Conference Proceedings

Kent Czechowski, Casey Battaglino, Chris Mcclanahan, Aparna Chandramowlishwaran, Richard Vuduc (2011) Balance principles for algorithm-architecture co-design, 1-5. In USENIX Wkshp. Hot Topics in Parallelism (HotPar).
Download PDF (285.56 KB)
Raghul Gunasekaran, David Dillow, Galen Shipman, Richard Vuduc, Edmond Chow (2011) Characterizing Application Runtime Behavior from System Logs and Metrics. In Proceedings of the 1st International Workshop on Characterizing Applications for Heterogeneous Exascale Systems (CACHES).
Download PDF (120.22 KB)
Aparna Chandramowlishwaran, Kathleen Knobe, Richard Vuduc (2010) Applying the Concurrent Collections programming model to asynchronous parallel dense linear algebra. In ACM SIGPLAN Symp. Principles and Practice of Parallel Programming (PPoPP).
Download PDF (204.42 KB)
Aparna Chandramowlishwaran, Kamesh Madduri, Richard Vuduc (2010) Diagnosis, Tuning, and Redesign for Multicore Performance: A Case Study of the Fast Multipole Method. In ACM/IEEE Conf. Supercomputing (SC).
Download PDF (1.09 MB)
Sangmin Park, Richard W Vuduc, Mary Jean Harrold (2010) Falcon: Fault localization for concurrent programs. In Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering - ICSE '10.
Download PDF (264.33 KB)
Jaekyu Lee, Nagesh B Lakshminarayana, Hyesoon Kim, Richard Vuduc (2010) Hardware and software prefetching mechanisms for GPGPU applications. In IEEE/ACM Int'l. Symp. Microarchitecture (MICRO).
Jee W. Choi, Amik Singh, Richard W. Vuduc (2010) Model-driven autotuning of sparse matrix-vector multiply on GPUs. In Proceedings of the 15th ACM SIGPLAN symposium on Principles and practice of parallel programming - PPoPP '10.
Download PDF (765.74 KB)
Richard Vuduc, Aparna Chandramowlishwaran, Jee Choi, M. Guney, A. Shringarpure (2010) On the limits of GPU acceleration. In USENIX Wkshp. Hot Topics in Parallelism (HotPar).
Download PDF (328.56 KB)
Aparna Chandramowlishwaran, Samuel Williams, Leonid Oliker, Ilya Lashuk, George Biros, Richard Vuduc (2010) Optimizing and tuning the fast multipole method for state-of-the-art multicore architectures, 1-12. In 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS).
Download PDF (674.24 KB)
Aparna Chandramowlishwaran, Kathleen Knobe, Richard Vuduc (2010) Performance evaluation of concurrent collections on high-performance multicore computing systems, 1-12. In 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS).
Download PDF (1.28 MB)
Abtin Rahimian, Ilya Lashuk, Shravan K Veerapaneni, Aparna Chandramowlishwaran, Dhairya Malhotra, Logan Moon, Rahul Sampath, Aashay Shringarpure, Jeffrey Vetter, Richard Vuduc, Denis Zorin, George Biros (2010) Petascale direct numerical simulation of blood flow on 200K cores and heterogeneous architectures. In ACM/IEEE Conf. Supercomputing (SC).
Download PDF (1.97 MB)
Sooraj Bhat, Ashish Agarwal, Alexander Gray, Richard Vuduc (2010) Toward interactive statistical modeling, 1835-1844. In Procedia Computer Science, International Conference on Computational Science (ICCS) 1 (1).
Download PDF (293.77 KB)
Ilya Lashuk, George Biros, Aparna Chandramowlishwaran, Harper Langston, Tuan-anh Nguyen, Rahul Sampath, Aashay Shringarpure, Richard Vuduc, Lexing Ying, Denis Zorin (2009) A massively parallel adaptive fast-multipole method on heterogeneous architectures. In Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis - SC '09.
Download PDF (706.63 KB)
Nitin Arora, Aashay Shringarpure, Richard W Vuduc (2009) Direct n-body kernels for multicore platforms, 379-387. In 2009 International Conference on Parallel Processing.
Download PDF (1010.94 KB)
Download PDF (639.84 KB)
Chunhua Liao, Daniel J Quinlan, Richard Vuduc, Thomas Panas (2009) Effective source-to-source outlining to support whole program empirical optimization. In Int'l. Wkshp. Languages and Compilers for Parallel Computing (LCPC).
Download PDF (288.21 KB)
Nitin Arora, Ryan P Russell, Richard W Vuduc (2009) Fast sensitivity computations for numerical optimizations. In Proc.~AAS/AIAA Astrodynamics Specialist Conference.
Download PDF (427.41 KB)
Manisha Gajbe, Andrew Canning, John Shalf, Lin-Wang Wang, Harvey Wasserman, Richard Vuduc (2009) Optimization and auto-tuning of 3D FFTs on the Cray XT4. In Proc.~Cray User's Group (CUG) Meeting.
Download PDF (590.26 KB)
Sundaresan Venkatasubramanian, Richard W Vuduc, None None (2009) Tuned and wildly asynchronous stencil kernels for hybrid CPU/GPU systems. In Proceedings of the 23rd international conference on Conference on Supercomputing - ICS '09.
Download PDF (3.31 MB)
Seunghwa Kang, David A Bader, Richard Vuduc (2009) Understanding the design trade-offs among current multicore systems for numerical computations, 1-12. In 2009 IEEE International Symposium on Parallel & Distributed Processing.
Download PDF (848.28 KB)
T Panas, D Quinlan, R Vuduc (2007) Analyzing and Visualizing Whole Program Architectures. In Wkshp. Aerospace Software Engineering (AeroSE), at ACM/IEEE Int'l. Conf. Software Eng. (ICSE).
Download PDF (908.59 KB)
Samuel Williams, Leonid Oliker, Richard Vuduc, John Shalf, Katherine Yelick, James Demmel (2007) Optimization of sparse matrix-vector multiplication on emerging multicore platforms. In Proceedings of the 2007 ACM/IEEE conference on Supercomputing - SC '07.
Download PDF (781.69 KB)
D.J. Quinlan, R.W. Vuduc, Ghassan Misherghi (2007) Techniques for specifying bug patterns, 27–35. In Proceedings of the 2007 ACM workshop on Parallel and distributed systems: testing and debugging.
Download PDF (380.26 KB)
Thomas Panas, Dan Quinlan, Richard Vuduc (2007) Tool Support for Inspecting the Code Quality of HPC Applications. In Third International Workshop on Software Engineering for High Performance Computing Applications (SE-HPC '07).
Download PDF (519.86 KB)
Qing Yi, Keith Seymour, Haihang You, Richard Vuduc, Dan Quinlan (2007) {POET}: {P}arameterized {O}ptimizations for {E}mpirical {T}uning, 1-8. In Wkshp. Performance Optimization of High-level Languages and Libraries (POHLL), at IEEE Int'l. Par. Distrib. Processing Symp. (IPDPS).
Download PDF (246.74 KB)
Dan Quinlan, Markus Schordan, Richard Vuduc, Qing Yi (2006) Annotating User-Defined Abstractions for Optimization, 1-8. In Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.
Download PDF (172.74 KB)
Richard Vuduc, Martin Schulz, Dan Quinlan, Bronis de Supinski, Andreas Sæbjørnsen (2006) Improving distributed memory applications testing by message perturbation. In Proceeding of the 2006 workshop on Parallel and distributed systems: testing and debugging - PADTAD '06.
Download PDF (683.46 KB)
Dan Quinlan, Richard Vuduc, Thomas Panas, Jochen Härdtlein, Andreas Sæbjørnsen (2006) Support for whole-program analysis and the verification of the one-definition rule in C++, 27-35. In Static Analysis Summit (SAS).
Dan Quinlan, Shmuel Ur, Richard Vuduc (2005) An extensible open-source compiler infrastructure for testing, 116-133. In IBM Haifa Verification Conf. (VC).
Download PDF (216.09 KB)
Richard W (Lawrence Livermore National Laboratory) Vuduc, Los Angeles) Moon, Hyun-jin (University Of California (2005) Fast sparse matrix-vector multiplication by exploiting variable block structure, 807-816. In High-Performance Computing and Communications Conf. (HPCC).
Download PDF (503.58 KB)
BC Lee, RW Vuduc, JW Demmel, K.A. Yelick (2004) Performance models for evaluation and automatic tuning of symmetric sparse matrix-vector multiply, 169-176 vol.1. In International Conference on Parallel Processing, 2004. ICPP 2004..
Download PDF (183.33 KB)
R Vuduc, A Gyulassy, J. Demmel, K. Yelick (2003) Memory Hierarchy Optimizations and Performance Bounds for Sparse A T Ax, 704–705. In Wkshp. Parallel Linear Algebra (PLA), at Int'l. Conf. Computational Sci. (ICCS).
Download PDF (221.29 KB)
R Vuduc, S Kamil, J Hsu, R Nishtala, J W Demmel, K A Yelick (2002) Automatic performance tuning and analysis of sparse triangular solve. In Wkshp. Performance Optimization of High-level Languages and Libraries (POHLL), at ACM Int'l. Conf. Supercomputing (ICS).
Download PDF (548.33 KB)
R Vuduc, J.W. Demmel, K.A. Yelick, S Kamil, R Nishtala, B Lee (2002) Performance optimizations and bounds for sparse matrix-vector multiply. In ACM/IEEE Conf. Supercomputing (SC).
Download PDF (847.02 KB)
Richard Vuduc, James W Demmel, Jeff A Bilmes (2001) Statistical models for empirical search-based performance tuning, 117-126. In Int'l. Conf. Computational Science (ICCS).
Download PDF (294.94 KB)
Richard Vuduc, James W Demmel (2000) Code generators for automatic tuning of numerical kernels: {E}xperiences with {FFTW}. In Wkshp. Semantics, Applications, and Implementation of Program Generation (SAIG), at ACM SIGPLAN Conf. Functional Programming (ICFP).
Download PDF (409.29 KB)
Richard Vuduc, James Demmel, Jeff Bilmes (2000) Statistical modeling of feedback data in an automatic tuning system. In In MICRO-33: Third ACM Workshop on Feedback-Directed Dynamic Optimization.
Danyel Fisher, Kris Hildrum, Jason Hong, Mark Newman, Megan Thomas, Rich Vuduc (2000) SWAMI: A framework for collaborative filtering algorithm development and evaluation, 366-368. In Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '00.
Download PDF (312.01 KB)
Bohdan Balko, Irvin Kay, Richard Vuduc, John Neuberger (1996) An investigation of the possible enhancement of nuclear superfluorescence, 308. In Lasers '95.
Download PDF (1.44 MB)
Aparna Chandramowlishwaran, Abhinav Kahru, Ketan Umare, Richard Vuduc Numerical algorithms with tunable parallelism. In Wkshp. Software Tools for Multicore Systems (STMCS), at IEEE/ACM Int'l. Symp. Code Generation and Optimization (CGO).
Download PDF (553.28 KB)