Wen-mei W. Hwu - Publications

Affiliations: 
University of Illinois, Urbana-Champaign, Urbana-Champaign, IL 
Area:
Electronics and Electrical Engineering, Computer Science

104 high-probability publications. We are testing a new system for linking publications to authors. You can help! If you notice any inaccuracies, please sign in and mark papers as correct or incorrect matches. If you identify any major omissions or other inaccuracies in the publication list, please let us know.

Year Citation  Score
2018 Hwu W, Patel S. Accelerator Architectures A Ten-Year Retrospective Ieee Micro. 38: 56-62. DOI: 10.1109/Mm.2018.2877839  0.92
2016 Heo Y, Ramachandran A, Hwu WM, Ma J, Chen D. BLESS 2: Accurate, memory-efficient, and fast error correction method. Bioinformatics (Oxford, England). PMID 27153708 DOI: 10.1093/Bioinformatics/Btw146  0.56
2016 Banerjee SS, Athreya AP, Mainzer LS, Jongeneel CV, Hwu WM, Kalbarczyk ZT, Iyer RK. Efficient and scalable workflows for genomic analyses Didc 2016 - Proceedings of the Acm International Workshop On Data-Intensive Distributed Computing. 27-36. DOI: 10.1145/2912152.2912156  0.96
2016 Chang LW, Kim HS, Hwu WM. DySel: Lightweight dynamic selection for kernel-based data-parallel programming model International Conference On Architectural Support For Programming Languages and Operating Systems - Asplos. 2: 667-680. DOI: 10.1145/2872362.2872373  0.96
2016 El Hajj I, Merritt A, Zellweger G, Milojicic D, Achermann R, Faraboschi P, Hwu WM, Roscoe T, Schwan K. SpaceJMP: Programming with multiple virtual address spaces International Conference On Architectural Support For Programming Languages and Operating Systems - Asplos. 2: 353-368. DOI: 10.1145/2872362.2872366  0.96
2016 Chang LW, El Hajj I, Kim HS, Gómez-Luna J, Dakkak A, Hwu WM. A programming system for future proofing performance critical libraries Proceedings of the Acm Sigplan Symposium On Principles and Practice of Parallel Programming, Ppopp. 12. DOI: 10.1145/2851141.2851178  0.96
2016 Haydel N, Gesing S, Taylor I, Madey G, Dakkak A, De Gonzalo SG, Hwu WMW. Enhancing the Usability and Utilization of Accelerated Architectures via Docker Proceedings - 2015 Ieee/Acm 8th International Conference On Utility and Cloud Computing, Ucc 2015. 361-367. DOI: 10.1109/UCC.2015.57  0.96
2016 Gomez-Luna J, Sung IJ, Chang LW, Gonzalez-Linares JM, Guil N, Hwu WMW. In-Place Matrix Transposition on GPUs Ieee Transactions On Parallel and Distributed Systems. 27: 776-788. DOI: 10.1109/Tpds.2015.2412549  0.96
2016 Hwu WM. CUDA application development 2008 Ieee Hot Chips 20 Symposium, Hcs 2008. DOI: 10.1109/HOTCHIPS.2008.7476522  0.56
2015 Cabezas J, Gelado I, Stone JE, Navarro N, Kirk DB, Hwu WM. Runtime and Architecture Support for Efficient Data Exchange in Multi-Accelerator Applications. Ieee Transactions On Parallel and Distributed Systems : a Publication of the Ieee Computer Society. 26: 1405-1418. PMID 26180487 DOI: 10.1109/Tpds.2014.2316825  0.56
2015 Takizawa H, Hirasawa S, Sugawara M, Gelado I, Kobayashi H, Hwu WMW. Optimized Data Transfers Based on the OpenCL Event Management Mechanism Scientific Programming. 2015. DOI: 10.1155/2015/576498  0.96
2015 Cabezas J, Vilanova L, Gelado I, Jablin TB, Navarro N, Hwu WMW. Automatic parallelization of kernels in shared-memory multi-GPU nodes Proceedings of the International Conference On Supercomputing. 2015: 3-13. DOI: 10.1145/2751205.2751218  0.96
2015 Cabezas J, Jordà M, Gelado I, Navarro N, Hwu WM. GPU-SM: Shared memory multi-GPU programming Acm International Conference Proceeding Series. 2015: 13-24. DOI: 10.1145/2716282.2716286  0.96
2015 Cabezas J, Gelado I, Stone JE, Navarro N, Kirk DB, Hwu WM. Runtime and Architecture Support for Efficient Data Exchange in Multi-Accelerator Applications Ieee Transactions On Parallel and Distributed Systems. 26: 1405-1418. DOI: 10.1109/TPDS.2014.2316825  0.96
2015 Chen X, Chang LW, Rodrigues CI, Lv J, Wang Z, Hwu WM. Adaptive Cache Management for Energy-Efficient GPU Computing Proceedings of the Annual International Symposium On Microarchitecture, Micro. 2015: 343-355. DOI: 10.1109/MICRO.2014.11  0.96
2015 Luna JG, Chang LW, Sung IJ, Hwu WM, Guil N. In-place data sliding algorithms for many-core architectures Proceedings of the International Conference On Parallel Processing. 2015: 210-219. DOI: 10.1109/ICPP.2015.30  0.96
2015 Kim HS, Hajj IE, Stratton J, Lumetta S, Hwu WM. Locality-centric thread scheduling for bulk-synchronous programming models on CPU architectures Proceedings of the 2015 Ieee/Acm International Symposium On Code Generation and Optimization, Cgo 2015. 257-268. DOI: 10.1109/CGO.2015.7054205  0.4
2015 Hwu WM, Chang LW, Kim HS, Dakkak A, El Hajj I. Transitioning HPC software to exascale heterogeneous computing 2015 Computational Electromagnetics International Workshop, Cem 2015. 4-5. DOI: 10.1109/CEM.2015.7237412  0.4
2015 Sung IJ, Chung WH, Lee YW, Hwu WM. Mapping high-level programming languages to OpenCL 2.0: A compiler writer's perspective Heterogeneous Computing With Opencl 2.0: Third Edition. 249-272. DOI: 10.1016/B978-0-12-801414-1.00011-9  0.96
2015 Juckeland G, Brantley W, Chandrasekaran S, Chapman B, Che S, Colgrove M, Feng H, Grund A, Henschel R, Hwu WMW, Li H, Müller MS, Nagel WE, Perminov M, Shelepugin P, et al. SPEC ACCEL: A standard application suite for measuring hardware accelerator performance Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 8966: 46-67. DOI: 10.1007/978-3-319-17248-4_3  0.96
2014 Heo Y, Wu XL, Chen D, Ma J, Hwu WM. BLESS: bloom filter-based error correction solution for high-throughput sequencing reads. Bioinformatics (Oxford, England). 30: 1354-62. PMID 24451628 DOI: 10.1093/Bioinformatics/Btu030  0.96
2014 Cabezas J, Vilanova L, Gelado I, Jablin TB, Navarro N, Hwu WM. Automatic execution of single-GPU computations across multiple GPUs Parallel Architectures and Compilation Techniques - Conference Proceedings, Pact. 467-468. DOI: 10.1145/2628071.2628109  0.96
2014 Chen X, Wu S, Chang LW, Huang WS, Pearson C, Wang Z, Hwu WMW. Adaptive cache bypass and insertion for many-core accelerators Acm International Conference Proceeding Series. 1-8. DOI: 10.1145/2613908.2613909  0.96
2014 Rodrigues C, Jablin T, Dakkak A, Hwu WM. Triolet: A programming system that unifies algorithmic skeleton interfaces for high-performance cluster computing Acm Sigplan Notices. 49: 247-258. DOI: 10.1145/2555243.2555268  0.96
2014 Sung IJ, Gómez-Luna J, González-Linares JM, Guil N, Hwu WMW. In-place transposition of rectangular matrices on accelerators Acm Sigplan Notices. 49: 207-218. DOI: 10.1145/2555243.2555266  0.96
2014 Hwu WM. What is ahead for parallel computing Journal of Parallel and Distributed Computing. 74: 2574-2581. DOI: 10.1016/J.Jpdc.2014.02.005  0.96
2014 Chang LW, Hwu WMW. A guide for implementing tridiagonal solvers on GPUs Numerical Computations With Gpus. 29-44. DOI: 10.1007/978-3-319-06548-9_2  0.96
2013 Ahmad A, Shemonski ND, Adie SG, Kim HS, Hwu WM, Carney PS, Boppart SA. Real-time in vivo computed optical interferometric tomography. Nature Photonics. 7: 444-448. PMID 23956790 DOI: 10.1038/Nphoton.2013.71  0.96
2013 Gai J, Obeid N, Holtrop JL, Wu XL, Lam F, Fu M, Haldar JP, Hwu WM, Liang ZP, Sutton BP. More IMPATIENT: A Gridding-Accelerated Toeplitz-based Strategy for Non-Cartesian High-Resolution 3D MRI on GPUs. Journal of Parallel and Distributed Computing. 73: 686-697. PMID 23682203 DOI: 10.1016/J.Jpdc.2013.01.001  0.96
2013 Papakonstantinou A, Gururaj K, Stratton JA, Chen D, Cong J, Hwu WMW. Efficient compilation of CUDA kernels for high-performance computing on FPGAs Transactions On Embedded Computing Systems. 13. DOI: 10.1145/2514641.2514652  0.96
2013 Papakonstantinou A, Chen D, Hwu WM, Cong J, Yun L. Throughput-oriented kernel porting onto FPGAs Proceedings - Design Automation Conference. DOI: 10.1145/2463209.2488747  0.96
2013 Tanasic I, Vilanova L, Jordà M, Cabezas J, Gelado I, Navarro N, Hwu WM. Comparison based sorting for systems with multiple GPUs Acm International Conference Proceeding Series. 1-11. DOI: 10.1145/2458523.2458524  0.96
2013 Huang X, Rodrigues CI, Jones S, Buck I, Hwu WM. Scalable SIMD-parallel memory allocation for many-core machines Journal of Supercomputing. 64: 1008-1020. DOI: 10.1007/S11227-011-0680-7  0.96
2013 Atkinson IC, Liu G, Obeid N, Thulborn KR, Hwu WM. Rapid computation of sodium bioscales using gpu-accelerated image reconstruction International Journal of Imaging Systems and Technology. 23: 29-35. DOI: 10.1002/Ima.22033  0.96
2012 Wu XL, Heo Y, El Hajj I, Hwu WM, Chen D, Ma J. TIGER: tiled iterative genome assembler. Bmc Bioinformatics. 13: S18. PMID 23281792 DOI: 10.1186/1471-2105-13-S19-S18  0.96
2012 Kim H, Vuduc R, Baghsorkhi S, Hwu WM, Jee Choi. Performance analysis and tuning for general purpose graphics processing units (GPGPU) Synthesis Lectures On Computer Architecture. 20: 1-94. DOI: 10.2200/S00451ED1V01Y201209CAC020  0.96
2012 Baghsorkhi SS, Gelado I, Delahaye M, Hwu WMW. Efficient performance evaluation of memory hierarchy for highly multithreaded graphics processors Proceedings of the Acm Sigplan Symposium On Principles and Practice of Parallel Programming, Ppopp. 23-33. DOI: 10.1145/2145816.2145820  0.96
2012 Chang LW, Stratton JA, Kim HS, Hwu WMW. A scalable, numerically stable, high-performance tridiagonal solver using GPUs International Conference For High Performance Computing, Networking, Storage and Analysis, Sc. DOI: 10.1109/SC.2012.12  0.96
2012 Stratton JA, Rodrigues C, Sung IJR, Chang LW, Anssari N, Liu GD, Hwu WMW, Obeid N. Algorithm and data optimization techniques for scaling to massively threaded systems Computer. 45: 26-32. DOI: 10.1109/Mc.2012.194  0.96
2012 Sung IJ, Liu GD, Hwu WMW. DL: A data layout transformation system for heterogeneous computing 2012 Innovative Parallel Computing, Inpar 2012. DOI: 10.1109/InPar.2012.6339606  0.96
2012 Kim HS, Ahn M, Stratton JA, Hwu WMW. Design evaluation of OpenCL compiler framework for coarse-grained reconfigurable arrays Fpt 2012 - 2012 International Conference On Field-Programmable Technology. 313-320. DOI: 10.1109/FPT.2012.6412155  0.96
2012 Kofsky SM, Johnson DR, Stratton JA, Hwu WMW, Patel SJ, Lumetta SS. Implementing a GPU programming model on a non-GPU accelerator architecture Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 6161: 40-51. DOI: 10.1007/978-3-642-24322-6_5  0.96
2012 Adie SG, Ahmad A, Shemonski N, Graf BW, Kim H, Hwu WMW, Carney PS, Boppart SA. Interferometric synthetic aperture microscopy with computational adaptive optics for high-resolution tomography of scattering tissue Biomedical Optics, Biomed 2012. BW2A.1.  0.96
2011 Wu XL, Zhuo Y, Gai J, Lam F, Fu M, Haldar JP, Hwu WM, Liang ZP, Sutton BP. Advanced MRI reconstruction toolbox with accelerating on GPU Proceedings of Spie - the International Society For Optical Engineering. 7872. DOI: 10.1117/12.872204  0.96
2011 Showerman M, Enos J, Steffen C, Treichler S, Gropp W, Hwu WMW. EcoG: A power-efficient GPU cluster architecture for scientific computing Computing in Science and Engineering. 13: 83-87. DOI: 10.1109/MCSE.2011.30  0.96
2011 Wu XL, Gai J, Lam F, Fu M, Haldar JP, Zhuo Y, Liang ZP, Hwu WM, Sutton BP. Impatient MRI: Illinois Massively Parallel Acceleration Toolkit for image reconstruction with enhanced throughput in MRI Proceedings - International Symposium On Biomedical Imaging. 69-72. DOI: 10.1109/ISBI.2011.5872356  0.96
2011 Papakonstantinou A, Liang Y, Stratton JA, Gururaj K, Chen D, Hwu WMW, Cong J. Multilevel granularity parallelism synthesis on FPGAs Proceedings - Ieee International Symposium On Field-Programmable Custom Computing Machines, Fccm 2011. 178-185. DOI: 10.1109/FCCM.2011.29  0.96
2011 Zhuo Y, Wu XL, Haldar JP, Marin T, Hwu WmW, Liang ZP, Sutton BP. Using GPUs to accelerate advanced MRI reconstruction with field inhomogeneity compensation Gpu Computing Gems Emerald Edition. 709-722. DOI: 10.1016/B978-0-12-384988-5.00044-9  0.96
2010 Hwu W. Session details: Emerging technologies and interconnect Acm Sigarch Computer Architecture News. 38. DOI: 10.1145/3264044  0.92
2010 Stratton JA, Grover V, Marathe J, Aarts B, Murphy M, Hu Z, Hwu WMW. Efficient compilation of fine-grained SPMD-threaded programs for multicore CPUs Proceedings of the 2010 Cgo - the 8th International Symposium On Code Generation and Optimization. 111-119. DOI: 10.1145/1772954.1772971  0.96
2010 Gelado I, Cabezas J, Navarro N, Stone JE, Patel S, Hwu WMW. An asymmetric distributed shared memory model for heterogeneous parallel systems International Conference On Architectural Support For Programming Languages and Operating Systems - Asplos. 347-358. DOI: 10.1145/1736020.1736059  0.96
2010 Baghsorkhi SS, Delahaye M, Patel SJ, Gropp WD, Hwu WMW. An adaptive performance modeling tool for GPU architectures Proceedings of the Acm Sigplan Symposium On Principles and Practice of Parallel Programming, Ppopp. 105-114. DOI: 10.1145/1693453.1693470  0.96
2010 Zhuo Y, Wu XL, Haldar JP, Hwu WM, Liang ZP, Sutton BP. Accelerating iterative field-compensated MR image reconstruction on GPUs 2010 7th Ieee International Symposium On Biomedical Imaging: From Nano to Macro, Isbi 2010 - Proceedings. 820-823. DOI: 10.1109/ISBI.2010.5490112  0.96
2010 Wu XL, Obeid N, Hwu WM. Exploiting more parallelism from applications having generalized reductions on GPU architectures Proceedings - 10th Ieee International Conference On Computer and Information Technology, Cit-2010, 7th Ieee International Conference On Embedded Software and Systems, Icess-2010, Scalcom-2010. 1175-1180. DOI: 10.1109/CIT.2010.213  0.96
2010 Zhuo Y, Sutton B, Wu XL, Haldar J, Hwu WM, Liang ZP. Sparse regularization in MRI iterative reconstruction using GPUs Proceedings - 2010 3rd International Conference On Biomedical Engineering and Informatics, Bmei 2010. 2: 578-582. DOI: 10.1109/BMEI.2010.5640008  0.96
2010 Sung IJ, Stratton JA, Hwu WMW. Data layout transformation exploiting memory-level parallelism in structured grid many-core applications Parallel Architectures and Compilation Techniques - Conference Proceedings, Pact. 513-522. DOI: 10.1007/S10766-011-0182-5  0.96
2010 Shinn AF, Vanka SP, Hwu WW. Direct numerical simulation of turbulent flow in a square duct using a Graphics Processing Unit (GPU) 40th Aiaa Fluid Dynamics Conference 0.96
2009 Papakonstantinou A, Gururaj K, Stratton JA, Chen D, Cong J, Hwu WMW. High-performance CUDA kernel execution on FPGAs Proceedings of the International Conference On Supercomputing. 515-516. DOI: 10.1145/1542275.1542357  0.96
2009 Stone JE, Saam J, Hardy DJ, Vandivort KL, Hwu WMW, Schulten K. High performance computation and display of molecular orbitals on and multi-core cpus Proceedings of 2nd Workshop On General Purpose Processing On Graphics Processing Units, Gpgpu-2. 9. DOI: 10.1145/1513895.1513897  0.96
2009 Papakonstantinou A, Gururaj K, Stratton JA, Chen D, Cong J, Hwu WMW. FCUDA: Enabling efficient compilation of CUDA kernels onto FPGAs 2009 Ieee 7th Symposium On Application Specific Processors, Sasp 2009. 35-42. DOI: 10.1109/SASP.2009.5226333  0.96
2009 Hwu WMW, Nandakumar D, Haldar J, Atkinson IC, Sutton B, Liang ZP, Thulborn KR. Accelerating mr image reconstruction on GPUs Proceedings - 2009 Ieee International Symposium On Biomedical Imaging: From Nano to Macro, Isbi 2009. 1283-1286. DOI: 10.1109/ISBI.2009.5193297  0.96
2009 Roberts E, Stone JE, Sepúlveda L, Hwu WMW, Luthey-Schulten Z. Long time-scale simulations of in vivo diffusion using GPU hardware Ipdps 2009 - Proceedings of the 2009 Ieee International Parallel and Distributed Processing Symposium. DOI: 10.1109/IPDPS.2009.5160930  0.96
2009 Kindratenko VV, Enos JJ, Shi G, Showerman MT, Arnold GW, Stone JE, Phillips JC, Hwu WM. GPU clusters for high-performance computing Proceedings - Ieee International Conference On Cluster Computing, Iccc. DOI: 10.1109/CLUSTR.2009.5289128  0.96
2009 Hunter HC, Nystrom EM, Connors DA, Hwu WmW. Hardware-compiler co-design for adjustable data power savings Microprocessors and Microsystems. 33: 244-253. DOI: 10.1016/J.Micpro.2009.02.003  0.96
2008 Stone SS, Haldar JP, Tsao SC, Hwu WM, Sutton BP, Liang ZP. Accelerating Advanced MRI Reconstructions on GPUs. Journal of Parallel and Distributed Computing. 68: 1307-1318. PMID 21796230 DOI: 10.1016/J.Jpdc.2008.05.013  0.96
2008 Gelado I, Kelm JH, Ryoo S, Lumetta SS, Navarro N, Hwu WMW. CUBA: An architecture for efficient CPU/Co-processor data communication Proceedings of the International Conference On Supercomputing. 299-308. DOI: 10.1145/1375527.1375571  0.96
2008 Rodrigues CI, Hardy DJ, Stone JE, Schulten K, Hwu WMW. GPU acceleration of cutoff pair potentials for molecular modeling applications Conference On Computing Frontiers - Proceedings of the 2008 Conference On Computing Frontiers, Cf'08. 273-282. DOI: 10.1145/1366230.1366277  0.96
2008 Stone SS, Haldar JP, Tsao SC, Hwu WMW, Liang ZP, Sutton BP. Accelerating advanced MRI reconstructions on GPUs Conference On Computing Frontiers - Proceedings of the 2008 Conference On Computing Frontiers, Cf'08. 261-272. DOI: 10.1145/1366230.1366276  0.96
2008 Ryoo S, Rodrigues CI, Stone SS, Baghsorkhi SS, Ueng SZ, Stratton JA, Hwu WMW. Program optimization space pruning for a multithreaded GPU Proceedings of the 2008 Cgo - Sixth International Symposium On Code Generation and Optimization. 195-204. DOI: 10.1145/1356058.1356084  0.96
2008 Wah E, Johnson E, Auvil L, Thakkar U, Hwu WM, Kirk D, Dunning TH, Glotzer SC. Visualization and analysis of GPU summer school applicants and participants Proceedings - 4th Ieee International Conference On Escience, Escience 2008. 362-363. DOI: 10.1109/eScience.2008.134  0.96
2008 Ryoo S, Rodrigues CI, Stone SS, Stratton JA, Ueng SZ, Baghsorkhi SS, Hwu WmW. Program optimization carving for GPU computing Journal of Parallel and Distributed Computing. 68: 1389-1401. DOI: 10.1016/J.Jpdc.2008.05.011  0.96
2008 Stratton JA, Stone SS, Hwu WMW. MCUDA: An efficient implementation of CUDA kernels for multi-core CPUs Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5335: 16-30. DOI: 10.1007/978-3-540-89740-8_2  0.96
2008 Ueng SZ, Lathara M, Baghsorkhi SS, Hwu WMW. CUDA-Lite: Reducing GPU programming complexity Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5335: 1-15. DOI: 10.1007/978-3-540-89740-8_1  0.96
2007 Iyer RK, Kalbarczyk Z, Pattabiraman K, Healey W, Hwu WW, Klemperer P, Farivar R. Toward application-aware security and reliability Ieee Security and Privacy. 5: 57-62. DOI: 10.1109/MSP.2007.23  0.96
2007 Hwu WM, Ryoo S, Ueng SZ, Keim JH, Gelado I, Stone SS, Kidd RE, Baghsorkhi SS, Mahesri AA, Tsao SC, Navarro N, Lumetta SS, Frank MI, Patel SJ. Implicitly parallel programming models for thousand-core microprocessors Proceedings - Design Automation Conference. 754-759. DOI: 10.1109/DAC.2007.375265  0.96
2007 Sarno L, Hwu WMW, Lund C, Levy M, Larus JR, Reinders J, Cameron G, Lennard C, Corporation T. Corezilla: Build and tame the multicore beast? Proceedings - Design Automation Conference. 632-633. DOI: 10.1109/DAC.2007.375240  0.96
2007 Ryoo S, Ueng SZ, Rodrigues CI, Kidd RE, Frank MI, Hwu WMW. Automatic discovery of coarse-grained parallelism in media applications Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4050: 194-213. DOI: 10.1007/978-3-540-71528-3_13  0.96
2006 Barnes RD, Sias JW, Nystrom EM, Patel SJ, Navarro J, Hwu WMW. Beating in-order stalls with "Flea-Flicker" two-pass pipelining Ieee Transactions On Computers. 55: 18-33. DOI: 10.1109/Tc.2006.4  0.96
2006 Barnes RD, Ryoo S, Hwu WW. Tolerating Cache-Miss Latency with Multipass Pipelines Ieee Micro. 26: 40-47. DOI: 10.1109/Mm.2006.25  0.96
2003 Monks J, Ebert JP, Hwu WMW, Wolisz A. Energy saving and capacity improvement potential of power control in multi-hop wireless networks Computer Networks. 41: 313-330. DOI: 10.1016/S1389-1286(02)00416-4  0.96
2002 Hunter HC, Hwu WMW. Code coverage and input variability: Effects on architecture and compiler research Proceedings of the 2002 International Conference On Compilers, Architecture, and Synthesis For Embedded Systems, Cases '02. 79-87. DOI: 10.1145/581630.581643  0.96
2002 Barnes RD, Nystrom EM, Merten MC, Hwu WMW. Vacuum packing: Extracting hardware-detected program phases for post-link optimization Proceedings of the Annual International Symposium On Microarchitecture, Micro. 2002: 233-244. DOI: 10.1109/MICRO.2002.1176253  0.96
2001 Merten MC, Trick AR, Barnes RD, Nystrom EM, George CN, Gyllenhaal JC, Hwu WMW. An architectural framework for runtime optimization Ieee Transactions On Computers. 50: 567-589. DOI: 10.1109/12.931894  0.96
2001 Monks JP, Bharghavan V, Hwu WW. A power controlled multiple access protocol for wireless packet networks Proceedings - Ieee Infocom. 1: 219-228.  0.96
2000 Cheng BC, Hwu WW. Modular interprocedural pointer analysis using access paths: Design, implementation, and evaluation Proceedings of the Acm Sigplan Conference On Programming Language Design and Implementation (Pldi). 57-69.  0.96
1999 Johnson TL, Connors DA, Merten MC, Hwu WMW. Run-time cache bypassing Ieee Transactions On Computers. 48: 1338-1354. DOI: 10.1109/12.817393  0.96
1999 August DI, Hwu WMW, Mahlke SA. Partial reverse if-conversion framework for balancing control flow and predication International Journal of Parallel Programming. 27: 381-423. DOI: 10.1023/A:1018787007582  0.96
1998 Conte TM, Hirsch MA, Hwu WMW. Combining trace sampling with single pass methods for efficient cache simulation Ieee Transactions On Computers. 47: 714-719. DOI: 10.1109/12.689650  0.96
1997 Hsieh CHA, Conte MT, Johnson TL, Gyllenhaal JC, Hwu WMW. Optimizing NET compilers for improved java performance Computer. 30: 67-75. DOI: 10.1109/2.587551  0.96
1995 Hwu WMW, Hank RE, Lavery DM, Haab GE, Gyllenhaal JC, August DI, Gallagher DM, Mahlke SA. Compiler Technology for Future Microprocessors Proceedings of the Ieee. 83: 1625-1640. DOI: 10.1109/5.476079  0.96
1995 Chang PP, Warter NJ, Mahlke SA, Chen WY, Hwu WMW. Three Architectural Models for Compiler-Controlled Speculative Execution Ieee Transactions On Computers. 44: 481-494. DOI: 10.1109/12.376164  0.96
1995 Chang PP, Lavery DM, Mahlke SA, Chen WY, Hwu WmW. The Importance of Prepass Code Scheduling for Superscalar and Superpipelined Processors Ieee Transactions On Computers. 44: 353-370. DOI: 10.1109/12.372029  0.96
1995 Conte TM, Hwu WMW. Advances in Benchmarking Techniques: New Standards and Quantitative Metrics Advances in Computers. 41: 231-253. DOI: 10.1016/S0065-2458(08)60235-1  0.96
1994 Chen WY, Mahlke SA, Warter NJ, Anik S, Hwu WMW. Profile-assisted instruction scheduling International Journal of Parallel Programming. 22: 151-181. DOI: 10.1007/Bf02577873  0.96
1994 Hwu WM, Nicolau A. From the guest editors International Journal of Parallel Programming. 22: 207-208. DOI: 10.1007/BF02577732  0.96
1993 Mahlke SA, Chen WY, Bringmann RA, Hank RE, Hwu WMW, Rau BR, Schlansker MS. Sentinel Scheduling: A Model for Compiler-Controlled Speculative Execution Acm Transactions On Computer Systems (Tocs). 11: 376-408. DOI: 10.1145/161541.159765  0.96
1993 Hwu WMW, Mahlke SA, Chen WY, Chang PP, Warter NJ, Bringmann RA, Ouellette RG, Hank RE, Kiyohara T, Haab GE, Holm JG, Lavery DM. The superblock: An effective technique for VLIW and superscalar compilation The Journal of Supercomputing. 7: 229-248. DOI: 10.1007/Bf01205185  0.96
1992 Uvieghara GA, Hwu WmW, Nakagome Y, Jeong DK, Hodges DA, Patt YN, Lee DD. An Experimental Single-Chip Data Flow CPU Ieee Journal of Solid-State Circuits. 27: 17-28. DOI: 10.1109/4.109554  0.96
1992 Hwu WW, Chang PP. Efficient Instruction Sequencing with Inline Target Insertion Ieee Transactions On Computers. 41: 1537-1551. DOI: 10.1109/12.214662  0.96
1991 Conte TM, Hwu WMW. Benchmark Characterization Computer. 24: 48-56. DOI: 10.1109/2.67193  0.96
1989 Chang PP, Hwu WMW. Forward semantic: A compiler-assisted instruction fetch method for heavily pipelined processors Proceedings of the Annual International Symposium On Microarchitecture, Micro. 188-198. DOI: 10.1145/75362.75418  0.96
1989 Chang PP, Hwu WW. Inline Function Expansion for Compiling C Programs Acm Sigplan Notices. 24: 246-257. DOI: 10.1145/74818.74840  0.96
1989 Hwu W. Micro-21 from the program chair Acm Sigmicro Newsletter. 19: 24. DOI: 10.1145/378818.378848  0.92
1987 Hwu WMW, Patt YN. Checkpoint Repair for High-Performance Out-of-Order Execution Machines Ieee Transactions On Computers. 1496-1514. DOI: 10.1109/Tc.1987.5009500  0.96
Show low-probability matches.