Year |
Citation |
Score |
2016 |
Maleki S, Yang A, Burtscher M. Higher-order and tuple-based massively-parallel prefix sums Proceedings of the Acm Sigplan Conference On Programming Language Design and Implementation (Pldi). 13: 539-552. DOI: 10.1145/2908080.2908089 |
0.334 |
|
2015 |
Yang A, Mukka H, Hesaaraki F, Burtscher M. MPC: A massively parallel compression algorithm for scientific data Proceedings - Ieee International Conference On Cluster Computing, Iccc. 2015: 381-389. DOI: 10.1109/CLUSTER.2015.59 |
0.342 |
|
2014 |
Uzelac V, Milenkovic A, Milenkovic M, Burtscher M. Using branch predictors and variable encoding for on-the-fly program tracing Ieee Transactions On Computers. 63: 1008-1020. DOI: 10.1109/Tc.2012.267 |
0.381 |
|
2014 |
O'Neil MA, Burtscher M. Microarchitectural performance characterization of irregular GPU kernels Iiswc 2014 - Ieee International Symposium On Workload Characterization. 130-139. DOI: 10.1109/IISWC.2014.6983052 |
0.333 |
|
2012 |
Ratanaworabhan P, Burtscher M, Kirovski D, Zorn B, Nagpal R, Pattabiraman K. Efficient Runtime Detection and Toleration of Asymmetric Races Ieee Transactions On Computers. 61: 548-562. DOI: 10.1109/Tc.2011.48 |
0.733 |
|
2011 |
O'Neil MA, Burtscher M. Floating-point data compression at 75 Gb/s on a GPU Acm International Conference Proceeding Series. DOI: 10.1145/1964179.1964189 |
0.341 |
|
2011 |
Milenković A, Uzelac V, Milenkovicć M, Burtscher M. Caches and predictors for real-time, unobtrusive, and cost-effective program tracing in embedded systems Ieee Transactions On Computers. 60: 992-1005. DOI: 10.1109/Tc.2010.146 |
0.406 |
|
2010 |
Burtscher M, Ratanaworabhan P. gFPC: A self-tuning compression algorithm Data Compression Conference Proceedings. 396-405. DOI: 10.1109/DCC.2010.42 |
0.759 |
|
2009 |
Burtscher M, Ratanaworabhan P. FPC: A high-speed compressor for double-precision floating-point data Ieee Transactions On Computers. 58: 18-31. DOI: 10.1109/Tc.2008.131 |
0.788 |
|
2009 |
Burtscher M, Ratanaworabhan P. pFPC: A parallel compressor for floating-point data Data Compression Conference Proceedings. 43-52. DOI: 10.1109/DCC.2009.43 |
0.771 |
|
2009 |
Burtscher M, Ratanaworabhan P. pFPC: A parallel compressor for floating-point data Data Compression Conference Proceedings. 43-52. |
0.404 |
|
2008 |
Ratanaworabhan P, Burtscher M. Program phase detection based on critical basic block transitions Ispass 2008 - Ieee International Symposium On Performance Analysis of Systems and Software. 11-21. DOI: 10.1109/ISPASS.2008.4510734 |
0.725 |
|
2007 |
Burtscher M, Ratanaworabhan P. High throughput compression of double-precision floating-point data Data Compression Conference Proceedings. 293-302. DOI: 10.1109/DCC.2007.44 |
0.775 |
|
2007 |
Milenkovic M, Milenkovic A, Burtscher M. Algorithms and hardware structures for unobtrusive real-time compression of instruction and data address traces Data Compression Conference Proceedings. 283-292. DOI: 10.1109/DCC.2007.10 |
0.429 |
|
2006 |
Ganusov I, Burtscher M. Future execution: A prefetching mechanism that uses multiple cores to speed up single threads Acm Transactions On Architecture and Code Optimization. 3: 424-449. DOI: 10.1145/1187976.1187979 |
0.716 |
|
2006 |
Ganusov I, Burtscher M. Efficient emulation of hardware prefetchers via event-driven helper threading Parallel Architectures and Compilation Techniques - Conference Proceedings, Pact. 2006: 144-153. DOI: 10.1145/1152154.1152178 |
0.312 |
|
2006 |
Ratanaworabhan P, Burtscher M. Load instruction characterization and acceleration of the BioPerf programs Proceedings of the 2006 Ieee International Symposium On Workload Characterization, Iiswc - 2006. 71-79. DOI: 10.1109/IISWC.2006.302731 |
0.742 |
|
2006 |
Ratanaworabhan P, Ke J, Burtscher M. Fast lossless compression of scientific floating-point data Data Compression Conference Proceedings. 133-142. DOI: 10.1109/DCC.2006.35 |
0.774 |
|
2005 |
Sam NB, Burtscher M. Improving memory system performance with energy-efficient value speculation Acm Sigarch Computer Architecture News. 33: 121-127. DOI: 10.1145/1105734.1105751 |
0.44 |
|
2005 |
Burtscher M, Ganusov I, Jackson SJ, Ke J, Ratanaworabhan P, Sam NB. The VPC trace-compression algorithms Ieee Transactions On Computers. 54: 1329-1344. DOI: 10.1109/Tc.2005.186 |
0.68 |
|
2005 |
Ganusov I, Burtscher M. Future execution: A hardware prefetching technique for chip multiprocessors Parallel Architectures and Compilation Techniques - Conference Proceedings, Pact. 2005: 350-360. DOI: 10.1109/PACT.2005.23 |
0.321 |
|
2005 |
Liu CC, Ganusov I, Burtscher M, Tiwari S. Bridging the processor-memory performance gap with 3D IC technology Ieee Design and Test of Computers. 22: 556-564. DOI: 10.1109/Mdt.2005.134 |
0.684 |
|
2005 |
Burtscher M, Sam NB. Automatic generation of high-performance trace compressors Proceedings of the 2005 International Symposium On Code Generation and Optimization, Cgo 2005. 2005: 229-240. DOI: 10.1109/CGO.2005.6 |
0.37 |
|
2004 |
Ke J, Burtscher M, Speight E. Runtime compression of MPI messanes to improve the performance and scalability of parallel applications Proceedings of the Acm/Ieee Sc 2004 Conference: Bridging Communities. DOI: 10.1109/SC.2004.52 |
0.312 |
|
2004 |
Burtscher M. VPC3: A fast and effective trace-compression algorithm Performance Evaluation Review. 32: 167-176. |
0.375 |
|
2004 |
Ke J, Burtscher M, Speight E. Runtime compression of MPI messages to improve the performance and scalability of parallel applications Ieee/Acm Sc2004 Conference, Proceedings. 289-295. |
0.312 |
|
2003 |
Burtscher M, Jeeradit M. Compressing extended program traces using value predictors Parallel Architectures and Compilation Techniques - Conference Proceedings, Pact. 2003: 159-169. DOI: 10.1109/PACT.2003.1238012 |
0.44 |
|
2002 |
Burtscher M. An improved index function for (D)FCM predictors Acm Sigarch Computer Architecture News. 30: 19-24. DOI: 10.1145/571666.571677 |
0.315 |
|
2002 |
Burtscher M, Zorn BG. Hybrid load-value predictors Ieee Transactions On Computers. 51: 759-774. DOI: 10.1109/Tc.2002.1017696 |
0.425 |
|
2002 |
Burtscher M, Diwan A, Hauswirth M. Static load classification for improving the value predictability of data-cache misses Proceedings of the Acm Sigplan Conference On Programming Language Design and Implementation (Pldi). 222-233. |
0.34 |
|
Show low-probability matches. |