John Mellor-Crummey - Publications

Affiliations: 
Rice University, Houston, TX 
Area:
Computer Science

94 high-probability publications. We are testing a new system for linking publications to authors. You can help! If you notice any inaccuracies, please sign in and mark papers as correct or incorrect matches. If you identify any major omissions or other inaccuracies in the publication list, please let us know.

Year Citation  Score
2016 Yang C, Mellor-Crummey J. A practical solution to the cactus stack problem Annual Acm Symposium On Parallelism in Algorithms and Architectures. 11: 61-70. DOI: 10.1145/2935764.2935787  1
2016 Yang C, Mellor-Crummey J. A wait-free queue as fast as fetch-and-add Proceedings of the Acm Sigplan Symposium On Principles and Practice of Parallel Programming, Ppopp. 12. DOI: 10.1145/2851141.2851168  1
2016 Chabbi M, Mellor-Crummey J. Contention-conscious, locality-preserving locks Proceedings of the Acm Sigplan Symposium On Principles and Practice of Parallel Programming, Ppopp. 12. DOI: 10.1145/2851141.2851166  1
2016 Aji AM, Panwar LS, Ji F, Murthy K, Chabbi M, Balaji P, Bisset KR, Dinan J, Feng WC, Mellor-Crummey J, Ma X, Thakur R. MPI-ACC: Accelerator-Aware MPI for Scientific Applications Ieee Transactions On Parallel and Distributed Systems. 27: 1401-1414. DOI: 10.1109/Tpds.2015.2446479  1
2016 Murthy K, Mellor-Crummey J. Communication Avoiding Algorithms: Analysis and Code Generation for Parallel Systems Parallel Architectures and Compilation Techniques - Conference Proceedings, Pact. 2016: 150-162. DOI: 10.1109/PACT.2015.41  1
2016 Paul SR, Araya-Polo M, Mellor-Crummey J, Hohl D. Performance analysis and optimization of a hybrid seismic imaging application Procedia Computer Science. 80: 8-18. DOI: 10.1016/j.procs.2016.05.293  1
2015 Chabbi M, Fagan M, Mellor-Crummey J. High performance locks for multi-level NUMA systems Proceedings of the Acm Sigplan Symposium On Principles and Practice of Parallel Programming, Ppopp. 2015: 215-226. DOI: 10.1145/2688500.2688503  0.48
2015 Chabbi M, Lavrijsen W, De Jong W, Sen K, Mellor-Crummey J, Iancu C. Barrier elision for production parallel programs Proceedings of the Acm Sigplan Symposium On Principles and Practice of Parallel Programming, Ppopp. 2015: 109-119. DOI: 10.1145/2688500.2688502  1
2015 Murthy K, Mellor-Crummey J. A Compiler Transformation to Overlap Communication with Dependent Computation Proceedings - 2015 9th International Conference On Partitioned Global Address Space Programming Models, Pgas 2015. 90-92. DOI: 10.1109/PGAS.2015.17  1
2014 Liu X, Sharma K, Mellor-Crummey J. ArrayTool: A lightweight profiler to guide array regrouping Parallel Architectures and Compilation Techniques - Conference Proceedings, Pact. 405-415. DOI: 10.1145/2628071.2628102  1
2014 Surendran R, Raman R, Chaudhuri S, Mellor-Crummey J, Sarkar V. Test-driven repair of data races in structured parallel programs Acm Sigplan Notices. 49: 15-25. DOI: 10.1145/2594291.2594335  1
2014 Hiranandani S, Kennedy K, Mellor-Crummey J, Sethi A. Compilation techniques for block-cyclic distributions Proceedings of the International Conference On Supercomputing. 205-216. DOI: 10.1145/2591635.2667169  1
2014 Mellor-Crummey J, Hiranandani S, Sethi A. Author retrospective: Compilation techniques for block-cyclic distributions Proceedings of the International Conference On Supercomputing. 29-31. DOI: 10.1145/2591635.2591651  1
2014 Liu X, Mellor-Crummey J. A tool to analyze the performance of multithreaded programs on NUMA architectures Proceedings of the Acm Sigplan Symposium On Principles and Practice of Parallel Programming, Ppopp. 259-271. DOI: 10.1145/2555243.2555271  1
2014 Yang C, Bland W, Mellor-Crummey J, Balaji P. Portable, MPI-interoperable Coarray Fortran Acm Sigplan Notices. 49: 81-92. DOI: 10.1145/2555243.2555270  1
2014 Chabbi M, Liu X, Mellor-Crummey J. Call paths for pin tools Proceedings of the 12th Acm/Ieee International Symposium On Code Generation and Optimization, Cgo 2014. 76-86. DOI: 10.1145/2544137.2544164  1
2014 Wei L, Mellor-Crummey J. Autotuning tensor transposition Proceedings of the International Parallel and Distributed Processing Symposium, Ipdps. 342-351. DOI: 10.1109/IPDPSW.2014.43  1
2013 Chabbi M, Murthy K, Fagan M, Mellor-Crummey J. Effective sampling-driven performance tools for GPU-accelerated supercomputers International Conference For High Performance Computing, Networking, Storage and Analysis, Sc. DOI: 10.1145/2503210.2503299  1
2013 Liu X, Mellor-Crummey J. A data-centric profiler for parallel programs International Conference For High Performance Computing, Networking, Storage and Analysis, Sc. DOI: 10.1145/2503210.2503297  1
2013 Liu X, Mellor-Crummey J, Fagan M. A new approach for performance analysis of openMP programs Proceedings of the International Conference On Supercomputing. 69-80. DOI: 10.1145/2464996.2465433  1
2013 Aji AM, Panwar LS, Ji F, Chabbi M, Murthy K, Balaji P, Bisset KR, Dinan J, Feng WC, Mellor-Crummey J, Ma X, Thakur R. On the efficacy of GPU-integrated MPI for scientific applications Hpdc 2013 - Proceedings of the 22nd Acm International Symposium On High-Performance Parallel and Distributed Computing. 191-202. DOI: 10.1145/2462902.2462915  1
2013 Liu X, Mellor-Crummey J. Pinpointing data locality bottlenecks with low overhead Ispass 2013 - Ieee International Symposium On Performance Analysis of Systems and Software. 183-193. DOI: 10.1109/ISPASS.2013.6557169  1
2013 Yang C, Murthy K, Mellor-Crummey J. Managing asynchronous operations in Coarray Fortran 2.0 Proceedings - Ieee 27th International Parallel and Distributed Processing Symposium, Ipdps 2013. 1321-1332. DOI: 10.1109/IPDPS.2013.17  1
2012 Chabbi M, Mellor-Crummey J. DeadSpy: A tool to pinpoint program inefficiencies Proceedings - International Symposium On Code Generation and Optimization, Cgo 2012. 124-134. DOI: 10.1145/2259016.2259033  1
2012 Tallent NR, Mellor-Crummey J. Using sampling to understand parallel program performance Proceedings of the 5th International Workshop On Parallel Tools For High Performance Computing 2011. 13-25. DOI: 10.1007/978-3-642-31476-6_2  1
2011 Chabbi MM, Mellor-Crummey JM, Cooper KD. Efficiently exploring compiler optimization sequences with pairwise pruning Acm International Conference Proceeding Series. 34-45. DOI: 10.1145/2000417.2000421  1
2011 Tallent NR, Mellor-Crummey J, Franco M, Landrum R, Adhianto L. Scalable fine-grained call path tracing Proceedings of the International Conference On Supercomputing. 63-74. DOI: 10.1145/1995896.1995908  1
2011 Jin G, Mellor-Crummey J, Adhianto L, Scherer WN, Yang C. Implementation and performance evaluation of the HPC challenge benchmarks in Coarray Fortran 2.0 Proceedings - 25th Ieee International Parallel and Distributed Processing Symposium, Ipdps 2011. 1089-1100. DOI: 10.1109/IPDPS.2011.104  1
2011 Liu X, Mellor-Crummey J. Pinpointing data locality problems using data-centric analysis Proceedings - International Symposium On Code Generation and Optimization, Cgo 2011. 171-180. DOI: 10.1109/CGO.2011.5764685  1
2010 Scherer WN, Adhianto L, Jin G, Mellor-Crummey J, Yang C. Hiding latency in Coarray Fortran 2.0 Acm International Conference Proceeding Series. DOI: 10.1145/2020373.2020387  1
2010 Tallent NR, Mellor-Crummey JM, Porterfield A. Analyzing lock contention in multithreaded applications Proceedings of the Acm Sigplan Symposium On Principles and Practice of Parallel Programming, Ppopp. 269-279. DOI: 10.1145/1693453.1693489  1
2010 Tallent NR, Adhianto L, Mellor-Crummey JM. Scalable identification of load imbalance in parallel executions using call path profiles 2010 Acm/Ieee International Conference For High Performance Computing, Networking, Storage and Analysis, Sc 2010. DOI: 10.1109/SC.2010.47  1
2010 Adhianto L, Mellor-Crummey J, Tallent NR. Effectively presenting call path profiles of application performance Proceedings of the International Conference On Parallel Processing Workshops. 179-188. DOI: 10.1109/ICPPW.2010.35  1
2010 Adhianto L, Banerjee S, Fagan M, Krentel M, Marin G, Mellor-Crummey J, Tallent NR. HPCTOOLKIT: Tools for performance analysis of optimized parallel programs Concurrency Computation Practice and Experience. 22: 685-701. DOI: 10.1002/cpe  1
2009 Mellor-Crummey J, Adhianto L, Scherer WN, Jin G. A new vision for Coarray Fortran Acm International Conference Proceeding Series. DOI: 10.1145/1809961.1809969  1
2009 Tallent NR, Mellor-Crummey JM, Adhianto L, Fagan MW, Krentel M. Diagnosing performance bottlenecks in emerging petascale applications Proceedings of the Conference On High Performance Computing Networking, Storage and Analysis, Sc '09. DOI: 10.1145/1654059.1654111  1
2009 Tallent NR, Mellor-Crummey JM, Fagan MW. Binary analysis for measurement and attribution of program performance Acm Sigplan Notices. 44: 441-452. DOI: 10.1145/1542476.1542526  1
2009 Tallent NR, Mellor-Crummey JM. Effective performance measurement and analysis of multithreaded applications Acm Sigplan Notices. 44: 229-239. DOI: 10.1145/1504176.1504210  1
2009 Tallent NR, Mellor-Crummey JM. Identifying performance bottlenecks in work-stealing computations Computer. 42: 44-50. DOI: 10.1109/Mc.2009.396  1
2009 Chen JH, Choudhary A, De Supinski B, Devries M, Hawkes ER, Klasky S, Liao WK, Ma KL, Mellor-Crummey J, Podhorszki N, Sankaran R, Shende S, Yoo CS. Terascale direct numerical simulations of turbulent combustion using S3D Computational Science and Discovery. 2. DOI: 10.1088/1749-4699/2/1/015001  1
2009 Fowler R, Adhianto L, De Supinski B, Fagan M, Gamblin T, Krentel M, Mellor-Crummey J, Schulz M, Tallent N. Frontiers of performance analysis on leadership-class systems Journal of Physics: Conference Series. 180. DOI: 10.1088/1742-6596/180/1/012041  1
2009 De Supinski BR, Alam S, Bailey DH, Carrington L, Daley C, Dubey A, Gamblin T, Gunter D, Hovland PD, Jagode H, Karavanic K, Marin G, Mellor-Crummey J, Moore S, Norris B, et al. Modeling the Office of Science ten year facilities plan: The PERI Architecture Tiger Team Journal of Physics: Conference Series. 180. DOI: 10.1088/1742-6596/180/1/012039  1
2008 Marin G, Mellor-Crummey J. Pinpointing and exploiting opportunities for enhancing data reuse Ispass 2008 - Ieee International Symposium On Performance Analysis of Systems and Software. 115-126. DOI: 10.1109/ISPASS.2008.4510744  1
2008 Tallent N, Mellor-Crummey J, Adhianto L, Fagan M, Krentel M. HPCToolkit: Performance tools for scientific computing Journal of Physics: Conference Series. 125. DOI: 10.1088/1742-6596/125/1/012088  1
2008 Marin G, Jin G, Mellor-Crummey J. Managing locality in grand challenge applications: A case study of the gyrokinetic toroidal code Journal of Physics: Conference Series. 125. DOI: 10.1088/1742-6596/125/1/012087  1
2007 Coarfa C, Mellor-Crummey J, Froyd N, Dotsenko Y. Scalability analysis of SPMD codes using expectations Proceedings of the International Conference On Supercomputing. 13-22. DOI: 10.1145/1274971.1274976  1
2007 Marin G, Mellor-Crummey J. Application insight through performance modeling Conference Proceedings of the Ieee International Performance, Computing, and Communications Conference. 65-74. DOI: 10.1109/PCCC.2007.358880  1
2007 Mellor-Crummey J. Harnessing the power of emerging petascale platforms Journal of Physics: Conference Series. 78. DOI: 10.1088/1742-6596/78/1/012048  1
2006 Dotsenko Y, Coarfa C, Nakhleh L, Mellor-Crummey J, Roshan U. PRec-I-DCM3: a parallel framework for fast and accurate large-scale phylogeny reconstruction. International Journal of Bioinformatics Research and Applications. 2: 407-19. PMID 18048181 DOI: 10.1504/Ijbra.2006.011039  1
2006 Qasem A, Kennedy K, Mellor-Crummey J. Automatic tuning of whole applications using direct search and a performance-based transformation system Journal of Supercomputing. 36: 183-196. DOI: 10.1007/S11227-006-7957-2  1
2006 Coarfa C, Dotsenko Y, Mellor-Crummey J. Experiences with Sweep3D implementations in Co-array Fortran Journal of Supercomputing. 36: 101-121. DOI: 10.1007/S11227-006-7952-7  1
2006 Froyd N, Tallent N, Mellor-Crummey J, Fowler R. Call path profiling for unmodified, optimized binaries Proceedings of the Gcc Developers' Summit 2006. 21-35.  1
2005 Nakhleh L, Jin G, Zhao F, Mellor-Crummey J. Reconstructing phylogenetic networks using maximum parsimony. Proceedings / Ieee Computational Systems Bioinformatics Conference, Csb. Ieee Computational Systems Bioinformatics Conference. 93-102. PMID 16447967 DOI: 10.1109/CSB.2005.47  1
2005 Jin G, Mellor-Crummey J. Improving performance by reducing the memory footprint of scientific applications International Journal of High Performance Computing Applications. 19: 433-451. DOI: 10.1177/1094342005056138  1
2005 Strout MM, Mellor-Crummey J, Hovland P. Representation-independent program analysis Acm Sigplan/Sigsoft Workshop On Program Analysis For Software Tools and Engineering. 67-74. DOI: 10.1145/1108792.1108810  1
2005 Froyd N, Mellor-Crummey J, Fowler R. Low-overhead call path profiling of unmodified, optimized code Proceedings of the International Conference On Supercomputing. 81-90. DOI: 10.1145/1088149.1088161  1
2005 Coarfa C, Dotsenko Y, Mellor-Crummey J, Cantonnet F, Ei-Ghazawi T, Mohanti A, Yao Y, Chavarría-Miranda D. An evaluation of global address space languages: Co-array fortran and Unified Parallel C Proceedings of the Acm Sigplan Symposium On Principles and Practice of Parallel Programming, Ppopp. 36-47. DOI: 10.1145/1065944.1065950  1
2005 Jin G, Mellor-Crummey J. SFCGen: A framework for efficient generation of multi-dimensional space-filling curves by recursion Acm Transactions On Mathematical Software. 31: 120-148. DOI: 10.1145/1055531.1055537  1
2005 Kennedy K, Broom B, Chauhan A, Fowler RJ, Garvin J, Koelbel C, Mccosh C, Mellor-Crummey J. Telescoping languages: A system for automatic generation of domain languages Proceedings of the Ieee. 93: 387-408. DOI: 10.1109/JPROC.2004.840447  1
2005 Chavarría-Miranda D, Jin G, Mellor-Crummey J. COTS clusters vs. the earth simulator: An application study using IMPACT-3D Proceedings - 19th Ieee International Parallel and Distributed Processing Symposium, Ipdps 2005. 2005. DOI: 10.1109/IPDPS.2005.156  1
2005 Coarfa C, Dotsenko Y, Mellor-Crummey J, Nakhleh L, Roshan U. PRec-I-DCM3: A parallel framework for fast and accurate large scale phylogeny reconstruction Proceedings of the International Conference On Parallel and Distributed Systems - Icpads. 2: 346-350. DOI: 10.1109/ICPADS.2005.240  1
2005 Mandal A, Kennedy K, Koelbel C, Marin G, Mellor-Crummey J, Liu B, Johnsson L. Scheduling strategies for mapping application workflows onto the grid Proceedings of the Ieee International Symposium On High Performance Distributed Computing. 125-134.  1
2005 Chavarría-Miranda D, Mellor-Crummey J. Effective communication coalescing for data-parallel applications Proceedings of the Acm Sigplan Symposium On Principles and Practice of Parallel Programming, Ppopp. 13-25.  1
2005 Dotsenko Y, Coarfa C, Mellor-Crummey J, Chavarría-Miranda D. Experiences with Co-array Fortran on hardware shared memory platforms Lecture Notes in Computer Science. 3602: 332-347.  1
2005 Jin G, Mellor-Crummey J. Space-filling curve generation: A table-based approach Proceedings of the 2005 International Conference On Algorithmic Mathematics and Computer Science, Amcs'05. 40-46.  0.36
2004 Dotsenko Y, Coarfa C, Mellor-Crummey J. A multi-platform Co-array Fortran compiler Parallel Architectures and Compilation Techniques - Conference Proceedings, Pact. 29-40. DOI: 10.1109/PACT.2004.1342539  1
2004 Cooper K, Dasgupta A, Kennedy K, Koelbel C, Mandal A, Marin G, Mazina M, Mellor-Crummey J, Berman F, Casanova H, Chien A, Dail H, Liu X, Olugbile A, Sievert O, et al. New grid scheduling and rescheduling methods in the GrADS project Proceedings - International Parallel and Distributed Processing Symposium, Ipdps 2004 (Abstracts and Cd-Rom). 18: 2775-2782. DOI: 10.1007/S10766-005-3584-4  0.88
2004 Marin G, Mellor-Crummey J. Cross-architecture performance predictions for scientific applications using parameterized models Performance Evaluation Review. 32: 2-13.  1
2004 Coarfa C, Dotsenko Y, Eckhardt J, Mellor-Crummey J. Co-array fortran performance and potential: An NPB experimental study Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2958: 177-193.  1
2003 Darte A, Mellor-Crummey J, Fowler R, Chavarría-Miranda D. Generalized multipartitioning of multi-dimensional arrays for parallelizing line-sweep computations Journal of Parallel and Distributed Computing. 63: 887-911. DOI: 10.1016/S0743-7315(03)00103-5  1
2002 Chavarría-Miranda D, Mellor-Crummey J. An evaluation of data-parallel compiler support for line-sweep applications Parallel Architectures and Compilation Techniques - Conference Proceedings, Pact. 2002: 7-17. DOI: 10.1109/PACT.2002.1105969  1
2002 Kennedy K, Mazina M, Mellor-Crummey J, Cooper K, Torczon L, Berman F, Chien A, Dail H, Sievert O, Angulo D, Foster I, Aydt R, Reed D, Gannon D, Dongarra J, et al. Toward a framework for preparing and executing adaptive grid programs Proceedings - International Parallel and Distributed Processing Symposium, Ipdps 2002. 171. DOI: 10.1109/IPDPS.2002.1016570  1
2002 Darte A, Chavarría-Miranda D, Fowler R, Mellor-Crummey J. Generalized multipartitioning for multi-dimensional arrays Proceedings - International Parallel and Distributed Processing Symposium, Ipdps 2002. 246-255. DOI: 10.1109/IPDPS.2002.1015501  1
2002 Mellor-Crummey J, Adve V, Broom B, Chavarría-Miranda D, Fowler R, Jin G, Kennedy K, Yi Q. Advanced optimization strategies in the Rice dHPF compiler Concurrency Computation Practice and Experience. 14: 741-767. DOI: 10.1002/Cpe.647  1
2002 Jin G, Mellor-Crummey J. Experiences tuning SMG98 - A semicoarsening multigrid benchmark based on the hypre library Proceedings of the International Conference On Supercomputing. 305-314.  1
2001 Mellor-Crummey J, Whalley D, Kennedy K. Improving memory hierarchy performance for irregular applications using data and computation reorderings International Journal of Parallel Programming. 29: 217-247.  1
2001 Mellor-Crummey J, Fowler R, Whalley D. On providing useful information for analyzing and tuning applications Performance Evaluation Review. 29: 332-333.  1
2001 Chavarría-Miranda D, Mellor-Crummey J, Sarang T. Data-parallel compiler support for multipartitioning Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2150: 241-253.  1
2000 Chavarría-Miranda D, Mellor-Crummey J. Toward compiler support for scalable parallelism using multipartitioning Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1915: 272-284. DOI: 10.1007/3-540-40889-4_21  0.76
2000 Zhang K, Mellor-Crummey J, Fowler RJ. Compilation and runtime optimizations for software distributed shared memory Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1915: 182-191. DOI: 10.1007/3-540-40889-4_14  1
1999 McCurdy C, Mellor-Crummey J. An evaluation of computing paradigms for N-body simulations on distributed memory architectures Sigplan Notices (Acm Special Interest Group On Programming Languages). 34: 25-36.  1
1998 Adve V, Mellor-Crummey J. Using Integer Sets for Data-Parallel Program Analysis and Optimization Sigplan Notices (Acm Special Interest Group On Programming Languages). 33: 186-198.  1
1997 Roth G, Mellor-Crummey J, Kennedy K, Brickner RG. Compiling stencils in high performance fortran Proceedings of the International Conference On Supercomputing. DOI: 10.1145/509593.509605  1
1994 Adve V, Tseng CW, Carle A, Granston E, Hiranandani S, Kennedy K, Koelbel C, Kremer U, Mellor-Crummey J, Warren S. Requirements for Data-Parallel Programming Environments Ieee Parallel and Distributed Technology. 2: 48-58. DOI: 10.1109/M-Pdt.1994.329801  1
1994 Scott ML, Mellor-Crummey JM. Fast, contention-free combining tree barriers for shared-memory multiprocessors International Journal of Parallel Programming. 22: 449-481. DOI: 10.1007/BF02577741  1
1993 Mellor-Crummey J. Compile-time Support for Efficient Data Race Detection in Shared-Memory Parallel Programs Acm Sigplan Notices. 28: 129-139. DOI: 10.1145/174267.171370  1
1993 Cooper KD, Kennedy K, Mckinley KS, Mellor-Crummey JM, Torczon L, Hall MW, Hood RT, Warren SK. The ParaScope Parallel Programming Environment Proceedings of the Ieee. 81: 244-263. DOI: 10.1109/5.214549  1
1991 Mellor-Crummey JM, Scott ML. Scalable Reader-Writer Synchronization for Shared-Memory Multiprocessors Acm Sigplan Notices. 26: 106-113. DOI: 10.1145/109626.109637  1
1991 Mellor-Crummey JM, Scott ML. Synchronization without Contention Acm Sigplan Notices. 26: 269-278. DOI: 10.1145/106973.106999  1
1991 Mellor-Crummey JM, Scott ML. Algorithms for Scalable Synchronization on Shared-Memory Multiprocessors Acm Transactions On Computer Systems (Tocs). 9: 21-65. DOI: 10.1145/103727.103729  1
1990 Leblanc TJ, Mellor-Crummey JM, Fowler RJ. Analyzing parallel program executions using multiple views Journal of Parallel and Distributed Computing. 9: 203-217. DOI: 10.1016/0743-7315(90)90046-R  1
1989 Fowler RJ, LeBlanc TJ, Mellor-Crummey JM. An Integrated Approach to Parallel Program Debugging and Performance Analysis on Large-Scale Multiprocessors Acm Sigplan Notices. 24: 163-173. DOI: 10.1145/69215.69231  1
1989 Mellor-Crummey JM, LeBlanc TJ. Software instruction counter . 78-86.  1
1987 Leblanc TJ, Mellor-Crummey JM. Debugging Parallel Programs with Instant Replay Ieee Transactions On Computers. 471-482. DOI: 10.1109/TC.1987.1676929  1
Show low-probability matches.