Richard W. Vuduc, Ph.D.
Affiliations: | 1997-2004 | Computer Science Division | University of California, Berkeley, Berkeley, CA, United States |
Area:
High-performance computing, performance engineering, autotuningWebsite:
https://vuduc.orgGoogle:
"Richard Vuduc"Parents
Sign in to add mentorJames W. Demmel | grad student | 1997-2004 | UC Berkeley | |
(Automatic performance tuning of sparse matrix kernels.) | ||||
Katherine A. Yelick | grad student | 1997-2004 | UC Berkeley (Computer Science Tree) | |
(Automating performance tuning of sparse matrix kernels) |
Children
Sign in to add traineeAparna Chandramowlishwaran | grad student | 2008-2013 | Georgia Tech |
Cong Hou | grad student | 2009-2013 | Georgia Tech |
Sang Min Park | grad student | 2008-2014 | Georgia Tech |
Jee Whan Choi | grad student | 2008-2015 | Georgia Tech |
Mohammad M. Hossain | grad student | 2014-2016 | Georgia Tech |
Piyush Kumar Sao | grad student | 2011-2018 | Georgia Tech |
Jiajia Li | grad student | 2015-2018 | Georgia Tech |
Kenneth Czechowski | grad student | 2011-2019 | Georgia Tech |
Marat Dukhan | grad student | 2012-2021 | Georgia Tech |
BETA: Related publications
See more...
Publications
You can help our author matching system! If you notice any publications incorrectly attributed to this author, please sign in and mark matches as correct or incorrect. |
Li Z, Jia H, Zhang Y, et al. (2020) Automatic Generation of High-Performance FFT Kernels on Arm and X86 CPUs Ieee Transactions On Parallel and Distributed Systems. 31: 1925-1941 |
Sao P, Li XS, Vuduc R. (2019) A communication-avoiding 3D algorithm for sparse LU factorization on heterogeneous systems Journal of Parallel and Distributed Computing. 131: 218-234 |
Ma Y, Li J, Wu X, et al. (2019) Optimizing sparse tensor times matrix on GPUs Journal of Parallel and Distributed Computing. 129: 99-109 |
Hossain MM, Nath C, Tucker TM, et al. (2018) A Graphics Processor Unit-Accelerated Freeform Surface Offsetting Method for High-Resolution Subtractive Three-Dimensional Printing (Machining) Journal of Manufacturing Science and Engineering. 140 |
Du Z, Ge R, Lee VW, et al. (2017) Modeling the Power Variability of Core Speed Scaling on Homogeneous Multicore Systems Scientific Programming. 2017: 1-13 |
You Y, Demmel J, Czechowski K, et al. (2017) Design and Implementation of a Communication-Optimal Classifier for Distributed Kernel Support Vector Machines Ieee Transactions On Parallel and Distributed Systems. 28: 974-988 |
Wu Z, Tucker TM, Nath C, et al. (2016) Step Ring-Based Three-Dimensional Path Planning Via Graphics Processing Unit Simulation for Subtractive Three-Dimensional Printing Journal of Manufacturing Science and Engineering. 139 |
Hossain MM, Tucker TM, Kurfess TR, et al. (2016) Hybrid Dynamic Trees for Extreme-Resolution 3D Sparse Data Modeling Proceedings - 2016 Ieee 30th International Parallel and Distributed Processing Symposium, Ipdps 2016. 132-141 |
Park S, Vuduc R, Harrold MJ. (2015) UNICORN: A unified approach for localizing non-deadlock concurrency bugs Software Testing Verification and Reliability. 25: 167-190 |
Choi J, Chandramowlishwaran A, Madduri K, et al. (2014) A CPU-GPU hybrid implementation and model-driven scheduling of the fast multipole method Acm International Conference Proceeding Series. 64-71 |