Richard W. Vuduc, Ph.D.

Affiliations: 
1997-2004 Computer Science Division University of California, Berkeley, Berkeley, CA, United States 
Area:
High-performance computing, performance engineering, autotuning
Website:
https://vuduc.org
Google:
"Richard Vuduc"

Parents

Sign in to add mentor
James W. Demmel grad student 1997-2004 UC Berkeley
 (Automatic performance tuning of sparse matrix kernels.)
Katherine A. Yelick grad student 1997-2004 UC Berkeley (Computer Science Tree)
 (Automating performance tuning of sparse matrix kernels)

Children

Sign in to add trainee
Aparna Chandramowlishwaran grad student 2008-2013 Georgia Tech
Cong Hou grad student 2009-2013 Georgia Tech
Sang Min Park grad student 2008-2014 Georgia Tech
Jee Whan Choi grad student 2008-2015 Georgia Tech
Mohammad M. Hossain grad student 2014-2016 Georgia Tech
Piyush Kumar Sao grad student 2011-2018 Georgia Tech
Jiajia Li grad student 2015-2018 Georgia Tech
Kenneth Czechowski grad student 2011-2019 Georgia Tech
Marat Dukhan grad student 2012-2021 Georgia Tech
BETA: Related publications

Publications

You can help our author matching system! If you notice any publications incorrectly attributed to this author, please sign in and mark matches as correct or incorrect.

Li Z, Jia H, Zhang Y, et al. (2020) Automatic Generation of High-Performance FFT Kernels on Arm and X86 CPUs Ieee Transactions On Parallel and Distributed Systems. 31: 1925-1941
Sao P, Li XS, Vuduc R. (2019) A communication-avoiding 3D algorithm for sparse LU factorization on heterogeneous systems Journal of Parallel and Distributed Computing. 131: 218-234
Ma Y, Li J, Wu X, et al. (2019) Optimizing sparse tensor times matrix on GPUs Journal of Parallel and Distributed Computing. 129: 99-109
Hossain MM, Nath C, Tucker TM, et al. (2018) A Graphics Processor Unit-Accelerated Freeform Surface Offsetting Method for High-Resolution Subtractive Three-Dimensional Printing (Machining) Journal of Manufacturing Science and Engineering. 140
Du Z, Ge R, Lee VW, et al. (2017) Modeling the Power Variability of Core Speed Scaling on Homogeneous Multicore Systems Scientific Programming. 2017: 1-13
You Y, Demmel J, Czechowski K, et al. (2017) Design and Implementation of a Communication-Optimal Classifier for Distributed Kernel Support Vector Machines Ieee Transactions On Parallel and Distributed Systems. 28: 974-988
Wu Z, Tucker TM, Nath C, et al. (2016) Step Ring-Based Three-Dimensional Path Planning Via Graphics Processing Unit Simulation for Subtractive Three-Dimensional Printing Journal of Manufacturing Science and Engineering. 139
Hossain MM, Tucker TM, Kurfess TR, et al. (2016) Hybrid Dynamic Trees for Extreme-Resolution 3D Sparse Data Modeling Proceedings - 2016 Ieee 30th International Parallel and Distributed Processing Symposium, Ipdps 2016. 132-141
Park S, Vuduc R, Harrold MJ. (2015) UNICORN: A unified approach for localizing non-deadlock concurrency bugs Software Testing Verification and Reliability. 25: 167-190
Choi J, Chandramowlishwaran A, Madduri K, et al. (2014) A CPU-GPU hybrid implementation and model-driven scheduling of the fast multipole method Acm International Conference Proceeding Series. 64-71
See more...