Jagannathan Ramanujam - Publications

Affiliations: 
Louisiana State University, Baton Rouge, LA, United States 
Area:
Computer Science

152 high-probability publications. We are testing a new system for linking publications to authors. You can help! If you notice any inaccuracies, please sign in and mark papers as correct or incorrect matches. If you identify any major omissions or other inaccuracies in the publication list, please let us know.

Year Citation  Score
2016 Fang Y, Ding Y, Feinstein WP, Koppelman DM, Moreno J, Jarrell M, Ramanujam J, Brylinski M. GeauxDock: Accelerating Structure-Based Virtual Screening with Heterogeneous Computing. Plos One. 11: e0158898. PMID 27420300 DOI: 10.1371/journal.pone.0158898  1
2016 Hong C, Bao W, Cohen A, Krishnamoorthy S, Pouchet LN, Rastello F, Ramanujam J, Sadayappan P. Effective padding of multidimensional arrays to avoid cache conflict misses Proceedings of the Acm Sigplan Conference On Programming Language Design and Implementation (Pldi). 13: 129-144. DOI: 10.1145/2908080.2908123  1
2015 Ding Y, Fang Y, Feinstein WP, Ramanujam J, Koppelman DM, Moreno J, Brylinski M, Jarrell M. GeauxDock: A novel approach for mixed-resolution ligand docking using a descriptor-based force field. Journal of Computational Chemistry. PMID 26250822 DOI: 10.1002/jcc.24031  1
2015 Rawat P, Kong M, Henretty T, Holewinski J, Stock K, Pouchet LN, Ramanujam J, Rountev A, Sadayappan P. SDSLc: A multi-target domain-specific compiler for stencil computations Proceedings of Wolfhpc 2015: 5th International Workshop On Domain-Specific Languages and High-Level Frameworks For High Performance Computing - Held in Conjunction With Sc 2015: the International Conference For High Performance Computing, Networking, Storage and Analysis. DOI: 10.1145/2830018.2830025  1
2015 Grosser T, Ramanujam J, Pouchet LN, Sadayappan P, Pop S. Optimistic delinearization of parametrically sized arrays Proceedings of the International Conference On Supercomputing. 2015: 351-360. DOI: 10.1145/2751205.2751248  1
2015 Ravishankar M, Dathathri R, Elango V, Pouchet LN, Ramanujam J, Rountev A, Sadayappan P. Distributed memory code generation for mixed irregular/regular computations Proceedings of the Acm Sigplan Symposium On Principles and Practice of Parallel Programming, Ppopp. 2015: 65-75. DOI: 10.1145/2688500.2688515  1
2015 Elango V, Rastello F, Pouchet LN, Ramanujam J, Sadayappan P. On characterizing the data access complexity of programs Acm Sigplan Notices. 50: 567-580. DOI: 10.1145/2676726.2677010  1
2014 Elango V, Sedaghati N, Rastello F, Pouchet LN, Ramanujam J, Teodorescu R, Sadayappan P. On using the roofline model with lower bounds on data movement Acm Transactions On Architecture and Code Optimization. 11. DOI: 10.1145/2693656  1
2014 Luporini F, Varbanescu AL, Rathgeber F, Bercea GT, Ramanujam J, Ham DA, Kelly PHJ. Cross-loop optimization of arithmetic intensity for finite element local assembly Acm Transactions On Architecture and Code Optimization. 11. DOI: 10.1145/2687415  1
2014 Elango V, Rastello F, Pouchet LN, Ramanujam J, Sadayappan P. On characterizing the data movement complexity of computational DAGs for parallel execution Annual Acm Symposium On Parallelism in Algorithms and Architectures. 296-306. DOI: 10.1145/2612669.2612694  1
2014 Stock K, Kong M, Grosser T, Pouchet LN, Rastello F, Ramanujam J, Sadayappan P. A Framework for Enhancing Data Reuse via Associative Reordering Acm Sigplan Notices. 49: 65-76. DOI: 10.1145/2594291.2594342  1
2014 Strout MM, Luporini F, Krieger CD, Bertolli C, Bercea GT, Olschanowsky C, Ramanujam J, Kelly PHJ. Generalizing run-time tiling with the loop chain abstraction Proceedings of the International Parallel and Distributed Processing Symposium, Ipdps. 1136-1145. DOI: 10.1109/IPDPS.2014.118  1
2014 Krishnamoorthy S, Ramanujam J, Sadayappan P. Introduction to the JPDC special issue on domain-specific languages and high-level frameworks for high-performance computing Journal of Parallel and Distributed Computing. 74: 3175. DOI: 10.1016/j.jpdc.2014.09.011  1
2014 Fang Y, Feng S, Tam KM, Yun Z, Moreno J, Ramanujam J, Jarrell M. Parallel tempering simulation of the three-dimensional Edwards-Anderson model with compact asynchronous multispin coding on GPU Computer Physics Communications. 185: 2467-2478. DOI: 10.1016/j.cpc.2014.05.020  1
2014 Yun Z, Lei Z, Allen G, Katz DS, Ramanujam J. DA-TC: A novel application execution model in multicluster systems Cluster Computing. 17: 371-387. DOI: 10.1007/s10586-012-0228-5  1
2014 Konstantinidis A, Kelly PHJ, Ramanujam J, Sadayappan P. Parametric GPU code generation for affine loop programs Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 8664: 136-151. DOI: 10.1007/978-3-319-09967-5_8  1
2013 Tam KM, Fotso H, Yang SX, Lee TW, Moreno J, Ramanujam J, Jarrell M. Solving the parquet equations for the Hubbard model beyond weak coupling. Physical Review. E, Statistical, Nonlinear, and Soft Matter Physics. 87: 013311. PMID 23410464 DOI: 10.1103/PhysRevE.87.013311  1
2013 Tavarageri S, Ramanujam J, Sadayappan P. Adaptive parallel tiled code generation and accelerated auto-tuning International Journal of High Performance Computing Applications. 27: 412-425. DOI: 10.1177/1094342013493939  1
2013 Fauzia N, Elango V, Ravishankar M, Ramanujam J, Rastello F, Rountev A, Pouchet LN, Sadayappan P. Beyond reuse distance analysis: Dynamic analysis for characterization of data locality potential Transactions On Architecture and Code Optimization. 10. DOI: 10.1145/2555289.2555309  1
2013 Henretty T, Veras R, Franchetti F, Pouchet LN, Ramanujam J, Sadayappan P. A stencil compiler for short-vector SIMD architectures Proceedings of the International Conference On Supercomputing. 13-24. DOI: 10.1145/2464996.2467268  1
2013 Grosser T, Cohen A, Kelly PHJ, Ramanujam J, Sadayappan P, Verdoolaege S. Split tiling for GPUs: Automatic parallelization using trapezoidal tiles Acm International Conference Proceeding Series. 24-31. DOI: 10.1145/2458523.2458526  1
2012 Salamy H, Ramanujam J. An ILP solution to address code generation for embedded applications on digital signal processors Acm Transactions On Design Automation of Electronic Systems. 17. DOI: 10.1145/2209291.2209301  1
2012 Salamy H, Ramanujam J. Storage optimization through offset assignment with variable coalescing Transactions On Embedded Computing Systems. 11. DOI: 10.1145/2180887.2180893  1
2012 Salamy H, Ramanujam J. Code size reduction for array intensive applications on digital signal processors Journal of Circuits, Systems and Computers. 21. DOI: 10.1142/S0218126612500156  1
2012 Salamy H, Ramanujam J. An effective solution to task scheduling and memory partitioning for multiprocessor system-on-chip Ieee Transactions On Computer-Aided Design of Integrated Circuits and Systems. 31: 717-725. DOI: 10.1109/TCAD.2011.2181848  1
2012 Ravishankar M, Eisenlohr J, Pouchet LN, Ramanujam J, Rountev A, Sadayappan P. Code generation for parallel execution of a class of irregular loops on distributed memory systems International Conference For High Performance Computing, Networking, Storage and Analysis, Sc. DOI: 10.1109/SC.2012.30  1
2012 Lu Q, Gao X, Krishnamoorthy S, Baumgartner G, Ramanujam J, Sadayappan P. Empirical performance model-driven data layout optimization and library call selection for tensor contraction expressions Journal of Parallel and Distributed Computing. 72: 338-352. DOI: 10.1016/j.jpdc.2011.09.006  1
2012 Shirako J, Sharma K, Fauzia N, Pouchet LN, Ramanujam J, Sadayappan P, Sarkar V. Analytical bounds for optimal tile size selection Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 7210: 101-121. DOI: 10.1007/978-3-642-28652-0_6  1
2012 Sadayappan P, Ramanujam J. Chairs' welcome Acm Sigplan Notices. 47.  1
2011 Tavarageri S, Pouchet LN, Ramanujam J, Rountev A, Sadayappan P. Dynamic selection of tile sizes 18th International Conference On High Performance Computing, Hipc 2011. DOI: 10.1109/HiPC.2011.6152742  1
2011 Henretty T, Stock K, Pouchet LN, Franchetti F, Ramanujam J, Sadayappan P. Data layout transformation for stencil computations on short-vector SIMD architectures Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 6601: 225-245. DOI: 10.1007/978-3-642-19861-8_13  1
2010 Pouchet LN, Bondhugula U, Bastoul C, Cohen A, Ramanujam J, Sadayappan P, Vasilache N. Loop transformations: Convexity, pruning and optimization Conference Record of the Annual Acm Symposium On Principles of Programming Languages. 549-561. DOI: 10.1145/1926385.1926449  1
2010 Baskaran MM, Hartono A, Tavarageri S, Henretty T, Ramanujam J, Sadayappan P. Parameterized tiling revisited Proceedings of the 2010 Cgo - the 8th International Symposium On Code Generation and Optimization. 200-209. DOI: 10.1145/1772954.1772983  1
2010 Pouchet LN, Bondhugula U, Bastoul C, Cohen A, Ramanujam J, Sadayappan P. Combined iterative and model-driven optimization in an automatic parallelization framework 2010 Acm/Ieee International Conference For High Performance Computing, Networking, Storage and Analysis, Sc 2010. DOI: 10.1109/SC.2010.14  1
2010 Hartono A, Baskaran MM, Ramanujam J, Sadayappan P. DynTile: Parametric tiled loop generation for parallel execution on multicore processors Proceedings of the 2010 Ieee International Symposium On Parallel and Distributed Processing, Ipdps 2010. DOI: 10.1109/IPDPS.2010.5470459  1
2010 Wang TC, Ramanujam J. A dynamic heuristic algorithm for offset assignment Ics 2010 - International Computer Symposium. 895-900. DOI: 10.1109/COMPSYM.2010.5685384  1
2010 Baskaran MM, Ramanujam J, Sadayappan P. Automatic C-to-CUDA code generation for affine programs Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 6011: 244-263. DOI: 10.1007/978-3-642-11970-5_14  1
2009 Hartono A, Lu Q, Henretty T, Krishnamoorthy S, Zhang H, Baumgartner G, Bernholdt DE, Nooijen M, Pitzer R, Ramanujam J, Sadayappan P. Performance optimization of tensor contraction expressions for many-body methods in quantum chemistry. The Journal of Physical Chemistry. A. 113: 12715-23. PMID 19888780 DOI: 10.1021/jp9051215  1
2009 Hartono A, Baskaran MM, Bastoul C, Cohen A, Krishnamoorthy S, Norris B, Ramanujam J, Sadayappan P. Parametric multi-level tiling of imperfectly nested loops Proceedings of the International Conference On Supercomputing. 147-157. DOI: 10.1145/1542275.1542301  1
2009 Sankaran R, Ullmer B, Ramanujam J, Kallakuri K, Jandhyala S, Toole C, Laan C. Decoupling interaction hardware design using libraries of reusable electronics Proceedings of the 3rd International Conference On Tangible and Embedded Interaction, Tei'09. 331-337. DOI: 10.1145/1517664.1517732  1
2009 Baskaran MM, Vydyanathan N, Bondhugula UK, Ramanujam J, Rountev A, Sadayappan P. Compiler-assisted dynamic scheduling for effective parallelization of loop nests on multicore processors Acm Sigplan Notices. 44: 219-228. DOI: 10.1145/1504176.1504209  1
2009 Lu Q, Alias C, Bondhugula U, Henretty T, Krishnamoorthy S, Ramanujam J, Rountev A, Sadayappan P, Chen Y, Ngai TF, Lin H. Data layout transformation for enhancing data locality on NUCA chip multiprocessors Parallel Architectures and Compilation Techniques - Conference Proceedings, Pact. 348-357. DOI: 10.1109/PACT.2009.36  1
2009 Yun Z, Lei Z, Allen G, Katz DS, Kosar T, Jha S, Ramanujam J. An innovative application execution toolkit for multicluster grids Proceedings - Ieee International Conference On Cluster Computing, Iccc. DOI: 10.1109/CLUSTR.2009.5289121  1
2009 Salamy H, Ramanujam J. A framework for task scheduling and memory partitioning for multi-processor system-on-chip Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5409: 263-277. DOI: 10.1007/978-3-540-92990-1_20  1
2009 Baskaran MM, Vydyanathan N, Bondhugula UK, Ramanujam J, Rountev A, Sadayappan P. Compiler-assisted dynamic scheduling for effective parallelization of loop nests on multicore processors Acm Sigplan Notices. 44: 219-228.  1
2008 Bondhugula U, Hartono A, Ramanujam J, Sadayappan P. A practical automatic polyhedral parallelizer and locality optimizer Acm Sigplan Notices. 43: 101-113. DOI: 10.1145/1375581.1375595  1
2008 Baskaran MM, Bondhugula U, Krishnamoorthy S, Ramanujam J, Rountev A, Sadayappan P. A compiler framework for optimization of affine loop nests for GPGPUs Proceedings of the International Conference On Supercomputing. 225-234. DOI: 10.1145/1375527.1375562  1
2008 Bondhugula U, Baskaran M, Hartono A, Krishnamoorthy S, Ramanujam J, Rountev A, Sadayappan P. Towards effective automatic parallelization for multicore systems Ipdps Miami 2008 - Proceedings of the 22nd Ieee International Parallel and Distributed Processing Symposium, Program and Cd-Rom. DOI: 10.1109/IPDPS.2008.4536401  1
2008 Hong J, Ramanujam J. Scheduling DAGs for fixed-point DSP processors by using worm partitions Proceedings of the International Conference On Embedded Software and Systems, Icess 2008. 567-574. DOI: 10.1109/ICESS.2008.89  1
2008 Hong J, Ramanujam J. Address register allocation in digital signal processors Proceedings of the International Conference On Embedded Software and Systems, Icess 2008. 331-337. DOI: 10.1109/ICESS.2008.88  1
2008 Salamy H, Ramanujam J. Storage optimization through code size reduction for digital signal processors Proceedings of the 2008 Ieee/Acm/Ifip Workshop On Embedded Systems For Real-Time Multimedia, Estimedia 2008. 107-112. DOI: 10.1109/ESTMED.2008.4697006  1
2008 Salamy H, Ramanujam J. Optimal address register allocation for arrays in DSP applications Proceedings of the 2008 Ieee/Acm/Ifip Workshop On Embedded Systems For Real-Time Multimedia, Estimedia 2008. 67-72. DOI: 10.1109/ESTMED.2008.4696998  1
2008 Bondhugula U, Baskaran M, Krishnamoorthy S, Ramanujam J, Rountev A, Sadayappan P. Automatic transformations for communication-minimized parallelization and locality optimization in the polyhedral model Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4959: 132-146. DOI: 10.1007/978-3-540-78791-4_9  1
2008 Baskaran MM, Ramanujam J, Bondhugula U, Rountev A, Krishnamoorthy S, Sadayappan P. Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories Proceedings of the Acm Sigplan Symposium On Principles and Practice of Parallel Programming, Ppopp. 111-122.  1
2007 Krishnamoorthy S, Baskaran M, Bondhugula U, Ramanujam J, Rountev A, Sadayappan P. Effective automatic parallelization of stencil computations Proceedings of the Acm Sigplan Conference On Programming Language Design and Implementation (Pldi). 235-244. DOI: 10.1145/1250734.1250761  1
2007 Bondhugula U, Ramanujam J, Sadayappan P. Automatic mapping of nested loops to FPGAS Proceedings of the Acm Sigplan Symposium On Principles and Practice of Parallel Programming, Ppopp. 101-111. DOI: 10.1145/1229428.1229446  1
2007 Pinnepalli S, Hong J, Ramanujam J, Carver DL. Code size optimization for embedded processors using commutative transformations Proceedings - 13th Ieee International Conference On Embedded and Real-Time Computing Systems and Applications, Rtcsa 2007. 409-416. DOI: 10.1109/RTCSA.2007.28  1
2007 Gao X, Krishnamoorthy S, Sahoo SK, Lam CC, Baumgartner G, Ramanujam J, Sadayappan P. Efficient search-space pruning for integrated fusion and tiling transformations Concurrency Computation Practice and Experience. 19: 2425-2443. DOI: 10.1002/cpe.1182  1
2007 Hong J, Ramanujam J. Memory offset assignment for DSPs Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4523: 80-87.  1
2007 Salamy H, Ramanujam J. An effective heuristic for simple offset assignment with variable coalescing Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4382: 158-172.  1
2006 Ramanujam J, Hong J, Kandemir M, Narayan A, Agarwal A. Estimating and reducing the memory requirements of signal processing codes for embedded systems Ieee Transactions On Signal Processing. 54: 286-294. DOI: 10.1109/TSP.2005.855086  1
2006 Allam A, Ramanujam J, Baumgartner G, Sadayappan P. Memory minimization for tensor contractions using integer linear programming 20th International Parallel and Distributed Processing Symposium, Ipdps 2006. 2006. DOI: 10.1109/IPDPS.2006.1639717  1
2006 Auer AA, Baumgartner G, Bernholdt DE, Bibireata A, Choppella V, Cociorva D, Gao X, Harrison R, Krishnamoorthy S, Krishnan S, Lam CC, Lu Q, Nooijen M, Pitzer R, Ramanujam J, et al. Automatic code generation for many-body electronic structure methods: The tensor contraction engine Molecular Physics. 104: 211-228. DOI: 10.1080/00268970500275780  1
2006 Krishnan S, Krishnamoorthy S, Baumgartner G, Lam CC, Ramanujam J, Sadayappan P, Choppella V. Efficient synthesis of out-of-core algorithms using a nonlinear optimization solver Journal of Parallel and Distributed Computing. 66: 659-673. DOI: 10.1016/j.jpdc.2005.06.017  1
2006 Hartono A, Lu Q, Gao X, Krishnamoorthy S, Nooijen M, Baumgartner G, Bernholdt DE, Choppella V, Pitzer RM, Ramanujam J, Rountev A, Sadayappan P. Identifying cost-effective common subexpressions to reduce operation count in tensor contraction evaluations Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 3991: 267-275. DOI: 10.1007/11758501_39  1
2006 Allam AK, Ramanujam J. Modified force-directed scheduling for peak and average power optimization using multiple supply-voltages 2006 Ieee International Conference On Integrated Circuit Design and Technology, Icicdt'06 1
2006 Allam AK, Ramanujam J. Simultaneous peak and average power optimization in synchronous sequential designs using retiming and multiple supply voltages 2006 Ieee International Conference On Integrated Circuit Design and Technology, Icicdt'06 1
2006 Kandemir M, Ramanujam J, Sezer U. Improving the energy behavior of block buffering using compiler optimizations Acm Transactions On Design Automation of Electronic Systems. 11: 228-250.  1
2005 Gao X, Sahoo SK, Lam CC, Ramanujam J, Lu Q, Baumgartner G, Sadayappan P. Performance modeling and optimization of parallel out-of-core tensor contractions Proceedings of the Acm Sigplan Symposium On Principles and Practice of Parallel Programming, Ppopp. 266-276. DOI: 10.1145/1065944.1065980  1
2005 Baumgartner G, Auer A, Bernholdt DE, Bibireata A, Choppella V, Cociorva D, Gao X, Harrison RJ, Hirata S, Krishnamoorthy S, Krishnan S, Lam CC, Lu Q, Nooijen M, Pitzer RM, ... Ramanujam J, et al. Synthesis of high-performance parallel programs for a class of Ab Initio quantum chemistry models Proceedings of the Ieee. 93: 276-291. DOI: 10.1109/JPROC.2004.840311  1
2005 Cociorva D, Baumgartner G, Lam CC, Sadayappan P, Ramanujam J. Memory-constrained communication minimization for a class of array computations Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2481: 1-15. DOI: 10.1007/11596110_1  1
2005 Lu Q, Gao X, Krishnamoorthy S, Baumgartner G, Ramanujam J, Sadayappan P. Empirical performance-model driven data layout optimization Lecture Notes in Computer Science. 3602: 72-86.  1
2005 Hartono A, Sibiryakov A, Nooijen M, Baumgartner G, Bernholdt DE, Hirata S, Lam CC, Pitzer RM, Ramanujam J, Sadayappan P. Automated operation minimization of tensor contraction expressions in electronic structure calculations Lecture Notes in Computer Science. 3514: 155-164.  1
2004 Kandemir M, Kadayif I, Choudhary A, Ramanujam J, Kolcu I. Compiler-Directed Scratch Pad Memory Optimization for Embedded Multiprocessors Ieee Transactions On Very Large Scale Integration (Vlsi) Systems. 12: 281-287. DOI: 10.1109/TVLSI.2004.824299  1
2004 Kandemir M, Ramanujam J, Irwin MJ, Vijaykrishnan N, Kadayif I, Parikh A. A compiler-based approach for dynamically managing scratch-pad memories in embedded systems Ieee Transactions On Computer-Aided Design of Integrated Circuits and Systems. 23: 243-260. DOI: 10.1109/TCAD.2003.822123  1
2004 Krishnan S, Krishnamoorthy S, Baumgartner G, Lam CC, Ramanujam J, Sadayappan P, Choppella V. Efficient synthesis of out-of-core algorithms using a nonlinear optimization solver Proceedings - International Parallel and Distributed Processing Symposium, Ipdps 2004 (Abstracts and Cd-Rom). 18: 471-480.  1
2004 Bibireata A, Krishnan S, Baumgartner G, Cociorva D, Lam CC, Sadayappan P, Ramanujam J, Bernholdt DE, Choppella V. Memory-constrained data locality optimization for tensor contractions Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2958: 93-108.  1
2003 Kandemir M, Choudhary A, Ramanujam J, Banerjee P. Reducing false sharing and improving spatial locality in a unified compilation framework Ieee Transactions On Parallel and Distributed Systems. 14: 337-354. DOI: 10.1109/TPDS.2003.1195407  1
2003 Cociorva D, Gao X, Krishnan S, Baumgartner G, Lam CC, Sadayappan P, Ramanujam J. Global communication optimization for tensor contraction expressions under memory constraints Proceedings - International Parallel and Distributed Processing Symposium, Ipdps 2003. DOI: 10.1109/IPDPS.2003.1213121  1
2003 Kandemir M, Irwin MJ, Chen G, Ramanujam J. Address register assignment for reducing code size Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2622: 273-289.  1
2003 Krishnan S, Krishnamoorthy S, Baumgartner G, Cociorva D, Lam CC, Sadayappan P, Ramanujam J, Bernholdt DE, Choppella V. Data locality optimization for synthesis of efficient out-of-core algorithms Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2913: 406-417.  1
2002 Baumgartner G, Bernholdt DE, Cociorva D, Harrison R, Lam CC, Nooijen M, Ramanujam J, Sadayappan P. A performance optimization framework for compilation of tensor contraction expressions into parallel Proceedings - International Parallel and Distributed Processing Symposium, Ipdps 2002. 106. DOI: 10.1109/IPDPS.2002.1016492  1
2002 Crosbie NE, Kandemir M, Kolcu I, Ramanujam J, Choudhary A. Strategies for improving data locality in embedded applications Proceedings - 7th Asia and South Pacific Design Automation Conference, 15th International Conference On Vlsi Design, Asp-Dac/Vlsi Design 2002. 631-636. DOI: 10.1109/ASPDAC.2002.995007  1
2002 Ramanujam J, Krishnamurthy S, Hong J, Kandemir M. Address code and arithmetic optimizations for embedded systems Proceedings - 7th Asia and South Pacific Design Automation Conference, 15th International Conference On Vlsi Design, Asp-Dac/Vlsi Design 2002. 619-624. DOI: 10.1109/ASPDAC.2002.995005  1
2002 Ramanujam J, Deshpande S, Hong J, Kandemir M. A heuristic for clock selection in high-level synthesis Proceedings - 7th Asia and South Pacific Design Automation Conference, 15th International Conference On Vlsi Design, Asp-Dac/Vlsi Design 2002. 414-419. DOI: 10.1109/ASPDAC.2002.994956  1
2002 Kandemir M, Choudhary A, Ramanujam J. An I/O-conscious tiling strategy for disk-resident data sets Journal of Supercomputing. 21: 257-284. DOI: 10.1023/A:1014156327748  1
2002 Kandemir M, Ramanujam J, Choudhary A. Exploiting shared scratch pad memory space in embedded multiprocessor systems Proceedings - Design Automation Conference. 219-224.  1
2002 Cociorva D, Baumgartner G, Lam CC, Sadayappan P, Ramanujam J, Nooijen M, Bernholdt DE, Harrison R. Space-time trade-off optimization for a class of electronic structure calculations Proceedings of the Acm Sigplan Conference On Programming Language Design and Implementation (Pldi). 177-186.  1
2001 Kandemir M, Banerjee P, Choudhary A, Ramanujam J, Ayguadé E. Static and dynamic locality optimizations using integer linear programming Ieee Transactions On Parallel and Distributed Systems. 12: 922-941. DOI: 10.1109/TPDS.2001.1184186  1
2001 Kandemir M, Ramanujam J, Choudhary A, Banerjee P. A layout-conscious iteration space transformation technique Ieee Transactions On Computers. 50: 1321-1336. DOI: 10.1109/TC.2001.970571  1
2001 Rele S, Jain V, Pande S, Ramanujam J. Compact and efficient code generation through program restructuring on limited memory embedded DSPs Ieee Transactions On Computer-Aided Design of Integrated Circuits and Systems. 20: 477-494. DOI: 10.1109/43.918207  1
2001 Atri S, Ramanujam J, Kandemir M. Improving offset assignment for embedded processors Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2017: 158-172. DOI: 10.1007/3-540-45574-4_11  1
2001 Narasimhan M, Ramanujam J. A fast approach to computing exact solutions to the Resource-Constrained Scheduling problem Acm Transactions On Design Automation of Electronic Systems. 6: 490-500.  1
2001 Kandemir M, Ramanujam J, Sezer U. Compiler support for block buffering Proceedings of the International Symposium On Low Power Electronics and Design, Digest of Technical Papers. 76-79.  1
2001 Ramanujam J, Hong J, Kandemir M, Narayan A. Reducing memory requirements of nested loops for embedded systems Proceedings - Design Automation Conference. 359-364.  1
2001 Kandemir M, Ramanujam J, Irwin MJ, Vijaykrishnan N, Kadayif I, Parikh A. Dynamic management of scratch-pad memory space Proceedings - Design Automation Conference. 690-695.  1
2001 Cociorva D, Wilkins JW, Lam C, Baumgartner G, Sadayappan P, Ramanujam J. Loop optimizations for a class of memory-constrained computations Proceedings of the International Conference On Supercomputing. 103-113.  1
2001 Kadayif I, Kandemir M, Vijaykrishnan N, Irwin MJ, Ramanujam J. Morphable cache architectures: Potential benefits Sigplan Notices (Acm Special Interest Group On Programming Languages). 36: 128-137.  1
2001 Cociorva D, Wilkins J, Baumgartner G, Sadayappan P, Ramanujam J, Nooijen M, Bernholdt D, Harrison R. Towards automatic synthesis of high-performance codes for electronic structure calculations: Data locality optimization Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2228: 237-248.  1
2000 Kandemir M, Choudhary A, Banerjee P, Ramanujam J, Shenoy N. Minimizing data and synchronization costs in one-way communication Ieee Transactions On Parallel and Distributed Systems. 11: 1232-1251. DOI: 10.1109/71.895791  1
2000 Kandemir M, Choudhary A, Ramanujam J, Kandaswamy MA. A unified framework for optimizing locality, parallelism, and communication in out-of-core computations Ieee Transactions On Parallel and Distributed Systems. 11: 648-668. DOI: 10.1109/71.877759  1
2000 Kandemir M, Ramanujam J. Data relation vectors: a new abstraction for data optimizations Parallel Architectures and Compilation Techniques - Conference Proceedings, Pact. 227-236. DOI: 10.1109/12.947000  1
2000 Kandemir M, Ramanujam J, Choudhary A. Compiler Algorithms for Optimizing Locality and Parallelism on Shared and Distributed-Memory Machines Journal of Parallel and Distributed Computing. 60: 924-965. DOI: 10.1006/jpdc.2000.1639  1
2000 Narasimhan M, Ramanujam J. On lower bounds for scheduling problems in high-level synthesis Proceedings - Design Automation Conference. 546-551.  1
2000 Jain V, Rele S, Pande S, Ramanujam J. Code restructuring for improving real time response through code speed, size trade-offs on limited memory embedded DSPs Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1863: 459-463.  1
2000 Atri S, Ramanujam J, Kandemir M. Improving offset assignment on embedded processors using transformations Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1970: 367-374.  1
1999 Kandemir M, Choudhary A, Shenoy N, Banerjee P, Ramanujam J. A linear algebra framework for automatic determination of optimal data layouts Ieee Transactions On Parallel and Distributed Systems. 10: 115-135. DOI: 10.1109/71.752779  1
1999 Kandemir M, Ramanujam J, Choudhary A. Improving cache locality by a combination of loop and data transformations Ieee Transactions On Computers. 48: 159-167. DOI: 10.1109/12.752657  1
1999 Kandemir M, Choudhary A, Ramanujam J, Banerjee P. On reducing false sharing while improving locality on shared memory multiprocessors Parallel Architectures and Compilation Techniques - Conference Proceedings, Pact. 203-211.  1
1999 Kandemir M, Choudhary A, Ramanujam J, Banerjee P. Graph based framework to detect optimal memory layouts for improving data locality Proceedings of the International Parallel Processing Symposium, Ipps. 738-743.  1
1999 Kandemir M, Choudhary A, Ramanujam J. Restructuring I/O-intensive computations for locality Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1593: 1097-1106.  1
1999 Kandemir M, Choudhary A, Ramanujam J. I/O-conscious tiling for disk-resident data sets Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1685: 430-439.  1
1999 Kandemir M, Banerjee P, Choudhary A, Ramanujam J, Ayguade E. Integer linear programming approach for optimizing cache locality Proceedings of the International Conference On Supercomputing. 500-509.  1
1999 Kandemir M, Choudhary A, Ramanujam J, Banerjee P. A Matrix-Based Approach to Global Locality Optimization Journal of Parallel and Distributed Computing. 58: 190-235.  1
1999 Kandemir M, Ramanujam J, Choudhary A, Banerjee P. A loop transformation algorithm based on explicit data layout representation for optimizing locality Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1656: 34-50.  1
1999 Kandemir M, Banerjee P, Choudhary A, Ramanujam J, Shenoy N. A global communication optimization technique based on data-flow analysis and linear algebra Acm Transactions On Programming Languages and Systems. 21: 1251-1297.  1
1998 Kandemir M, Banerjee P, Choudhary A, Ramanujam J, Shenoy N. Generalized framework for global communication optimization Proceedings of the International Parallel Processing Symposium, Ipps. 69-73. DOI: 10.1109/IPPS.1998.669892  1
1998 Ramanujam J, Dutta S, Venkatachar A. Code generation for complex subscripts in data-parallel programs Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1366: 49-63. DOI: 10.1007/BFb0032683  1
1998 Narasimhan M, Ramanujam J. Improving the computational performance of ILP-based problems Ieee/Acm International Conference On Computer-Aided Design, Digest of Technical Papers. 593-596.  1
1998 Kandemir M, Choudhary A, Ramanujam J, Banerjee P. Improving locality using loop and data transformations in an integrated framework Proceedings of the Annual International Symposium On Microarchitecture. 285-296.  1
1998 Sadayappan P, Ercal F, Ramanujam J. Partitioning graphs on message-passing machines by pairwise mincut Information Sciences. 111: 223-237.  1
1998 Kandemir M, Choudhary A, Ramanujam J. Improving locality in out-of-core computations using data layout transformations Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1511: 359-366.  1
1998 Kandemir M, Choudhary A, Shenoy N, Banerjee P, Ramanujam J. Hyperplane based approach for optimizing spatial locality in loop nests Proceedings of the International Conference On Supercomputing. 69-76.  1
1998 Kandemir M, Choudhary A, Ramanujam J, Kandaswamy M. Locality optimization algorithms for compilation of out-of-core codes Journal of Information Science and Engineering. 14: 107-138.  1
1998 Kandemir M, Choudhary A, Ramanujam J, Bordawekar R. Compilation techniques for out-of-core parallel computations Parallel Computing. 24: 597-628.  1
1998 Kandemir M, Choudhary A, Ramanujam J, Shenoy N, Banerjee P. Enhancing spatial locality via data layout optimizations Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1470: 422-434.  1
1997 Venkatachar A, Ramanujam J, Thirumalai A. Generalized overlap regions for communication optimization in data-parallel programs Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1239: 404-419. DOI: 10.1007/BFb0017266  1
1997 Kandemir M, Ramanujam J, Choudhary A. Compiler algorithms for optimizing locality and parallelism on shared and distributed memory machines Parallel Architectures and Compilation Techniques - Conference Proceedings, Pact. 236-245.  1
1997 Kandemir M, Ramanujam J, Choudhary A. Compiler algorithm for optimizing locality in loop nests Proceedings of the International Conference On Supercomputing. 269-276.  1
1997 Kandemir M, Ramanujam J, Choudhary A. Improving the performance of out-of-core computations Proceedings of the International Conference On Parallel Processing. 128-136.  1
1997 Kandemir M, Ramanujam J, Choudhary A. Optimization of out-of-core computations using chain vectors Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1300: 601-608.  1
1997 Kandemir M, Choudhary A, Ramanujam J, Kandaswamy M. Unified compiler algorithm for optimizing locality, parallelism and communication in out-of-core computations Proceedings of the Annual Workshop On I/O in Parallel and Distributed Systems, Iopads. 79-92.  1
1996 Goel AK, Ramanujam J. A neural architecture for a class of abduction problems. Ieee Transactions On Systems, Man, and Cybernetics. Part B, Cybernetics : a Publication of the Ieee Systems, Man, and Cybernetics Society. 26: 854-60. PMID 18263085 DOI: 10.1109/3477.544299  1
1996 Thakur R, Choudhary A, Ramanujam J. Efficient algorithms for array redistribution Ieee Transactions On Parallel and Distributed Systems. 7: 587-594. DOI: 10.1109/71.506697  1
1996 Bordawekar R, Choudhary A, Ramanujam J. Compilation and communication strategies for out-of-core programs on distributed memory machines Journal of Parallel and Distributed Computing. 38: 277-288. DOI: 10.1006/jpdc.1996.0148  1
1996 Thirumalai A, Ramanujam J. Efficient computation of address sequences in data parallel programs using closed forms for basis vectors Journal of Parallel and Distributed Computing. 38: 188-203. DOI: 10.1006/jpdc.1996.0140  1
1996 Thirumalai A, Ramanujam J. Fast address sequence generation for data-parallel programs using integer lattices Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1033: 192-208.  1
1996 Bordawekar R, Choudhar A, Ramanujam J. A framework for integrated communication and I/O placement Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1123: 541-552.  1
1996 Bordawekar R, Choudhary A, Ramanujam J. Automatic optimization of communication in compiling out-of-core stencil codes Proceedings of the International Conference On Supercomputing. 366-373.  1
1995 Ramanujam J, Sadayappan P. Mapping combinatorial optimization problems onto neural networks Information Sciences. 82: 239-255. DOI: 10.1016/0020-0255(94)00052-D  1
1995 Ramanujam J. Beyond unimodular transformations The Journal of Supercomputing. 9: 365-389. DOI: 10.1007/BF01206273  1
1995 Ramanujam J, Vasanthakumar S. Statement-level independent partitioning of uniform recurrences Ieee Symposium On Parallel and Distributed Processing - Proceedings. 229-233.  1
1995 Ramanujam J, Mathew A. Analysis of event synchronization in parallel programs Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 892: 300-315.  1
1995 Kaushik SD, Huang CH, Ramanujam J, Sadayappan P. Multi-phase array redistribution: modeling and evaluation Ieee Symposium On Parallel and Distributed Processing - Proceedings. 441-445.  1
1994 Ramanujam J. Optimal software pipelining of nested loops Proceedings of the International Conference On Parallel Processing. 335-342.  1
1992 Ramanujam J, Sadayappan P. Tiling multidimensional iteration spaces for multicomputers Journal of Parallel and Distributed Computing. 16: 108-120. DOI: 10.1016/0743-7315(92)90027-K  1
1991 Ramanujam J, Sadayappan P. Compile-Time Techniques for Data Distribution in Distributed Memory Machines Ieee Transactions On Parallel and Distributed Systems. 2: 472-482. DOI: 10.1109/71.97903  1
1990 Ercal F, Ramanujam J, Sadayappan P. Task allocation onto a hypercube by recursive mincut bipartitioning Journal of Parallel and Distributed Computing. 10: 35-44. DOI: 10.1016/0743-7315(90)90004-9  1
1990 Sadayappan P, Ercal F, Ramanujam J. Cluster partitioning approaches to mapping parallel programs onto a hypercube Parallel Computing. 13: 1-16. DOI: 10.1016/0167-8191(90)90115-P  1
1989 Ramanujam J, Sadayappan P. Methodology for parallelizing programs for multicomputers and complex memory multiprocessors . 637-646.  1
1988 Ramanujam J, Sadayappan P. Optimization by neural networks . 325-332.  1
1988 Goel A, Ramanujam J, Sadayappan P. Towards a 'neural' architecture for abductive reasoning . 681-688.  1
Show low-probability matches.