Lab Fellow, Computer Science
High-Performance Computing
Lab Fellow, Computer Science
High-Performance Computing

Biography

Sriram Krishnamoorthy is a computer scientist and Laboratory Fellow in PNNL's High-Performance Computing group, and a research professor at Washington State University’s School of Electrical Engineering and Computer Science. His research focuses on parallel programming models, quantum computing, fault tolerance, and compile-time/runtime optimizations for high-performance computing. He has more than 100 peer-reviewed conference and journal publications and has received three Best Paper and two Best Student Paper Awards. In 2013, he received the U.S. Department of Energy Early (DOE) Career Award. That year, he also earned PNNL's Ronald L. Brodzinski Award for Early Career Exceptional Achievement. In 2008, he received The Ohio State University's Outstanding Researcher award. Dr. Krishnamoorthy is a senior member of the Institute of Electrical and Electronics Engineers (IEEE) and the Association for Computing Machinery (ACM).

Research Interests

  • High-Performance Computing
  • Quantum Computing
  • Parallel Programming Models
  • Fault Tolerance
  • Runtime Systems
  • Loop Transformations

Education

  • PhD in Computer Science and Engineering, The Ohio State University
  • MS in Computer Science and Engineering, The Ohio State University
  • BE in Computer Science and Engineering, College of Engineering, Guindy (Chennai, India)

Affiliations and Professional Service

  • Senior Member, IEEE
  • Senior Member, ACM

Awards and Recognitions

  • Best Student Paper Award, Supercomputing Conference (SC) (2020)
  • Best Paper Finalist, SC (2020)
  • Best Paper Award, International Symposium on Networks-on-Chip (NOCS) (2019)
  • Best Paper Finalist, SC (2014)
  • Best Student Paper Finalist, SC (2014)
  • Best Student Paper Award, IEEE Cluster (2014)
  • Early Career Award, DOE (2013)
  • Ronald L. Brodzinski Award for Early Career Exceptional Achievement, PNNL (2013)
  • Member, Operating Systems and Runtime Technical Council, Advanced Scientific Computing Research (ASCR)/Advanced Simulation and Computing program (ASC) Exascale (2012)
  • Member ASCR/ASC Resilience Technical Council (2012)
  • Outstanding Researcher Award, Computer Science and Engineering Department, Ohio State University (2008)
  •  High-Performance Computing Challenge Award, as part of IBM X10 Team, SC (2007)
  • PhD Fellowship, IBM (2007)
  • Presidential Fellowship, Ohio State University (2007)
  • Best Paper Award, International Parallel and Distributed Processing Symposium (IPDPS) (2004)
  • Best Paper Award, International Conference on High Performance Computing (HiPC) (2003)

Publications

2021

  • Ghosh P., S. Krishnamoorthy, and A. Kalyanaraman. 2021. "PaKman: A Scalable Algorithm for Generating Genomic Contigs on Distributed Memory Machines." IEEE Transactions on Parallel and Distributed Systems 32, no. 5:1191-1209. PNNL-SA-157408. doi:10.1109/TPDS.2020.3043241

2020

  • Apra E., E.J. Bylaska, W.A. De Jong, N. Govind, K. Kowalski, T.P. Straatsma, and M. Valiev, et al. 2020. "NWChem: Past, Present, and Future." The Journal of Chemical Physics 152, no. 18:184102. PNNL-SA-151542. doi:10.1063/5.0004997
  • Briggs I., A. Das, M. Baranowski, V. Sharma, S. Krishnamoorthy, Z. Rakamaric, and G. Gopalakrishnan. 2020. "FailAmp: Relativization Transformation for Soft Error Detection in Structured Address Generation." ACM Transactions on Architecture and Code Optimization 16, no. 4:Article No. 50. PNNL-SA-148816. doi:10.1145/3369381
  • Das A., S. Krishnamoorthy, I. Briggs, G. Gopalakrishnan, and R. Tipireddy. 2020. "FPDetect: efficient reasoning about stencil programs using selective direct evaluation." ACM Transactions on Architecture and Code Optimization 17, no. 3:19. PNNL-SA-153397. doi:10.1145/3402451
  • Peng B., K. Kowalski, A.R. Panyala, and S. Krishnamoorthy. 2020. "Green’s function coupled cluster simulation of the near-valence ionizations of DNA-fragments." Journal of Chemical Physics 152, no. 1:Article No. 011101. PNNL-SA-148915. doi:10.1063/1.5138658
  • Sabet A.H., J. Qiu, Z. Zhao, and S. Krishnamoorthy. 2020. "Reliability Analysis for Unreliable FSM Computations." ACM Transactions on Architecture and Code Optimization 17, no. 2:Article No. 12. PNNL-SA-150357. doi:10.1145/3377456

2019

  • Bao W., S. Krishnamoorthy, L. Pouchet, and P. Sadayappan. 2019. "Analytical Modeling of Cache Behavior for Affine Programs." In Proceedings of the ACM on Programming Languages (POPL 2018), January 7013, 2018, Los Angeles, CA, 2, Article No. 32. New York, New York:ACM. PNNL-SA-130215. doi:10.1145/3158120
  • Bauman N.P., E.J. Bylaska, S. Krishnamoorthy, G. Low, N.O. Wiebe, C.E. Granade, and M. Roetteler, et al. 2019. "Downfolding of many-body Hamiltonians using active-space models: extension of the sub-system embedding sub-algebras approach to unitary coupled cluster formalisms." Journal of Chemical Physics 151, no. 1:Article Number 014107. PNNL-SA-141041. doi:10.1063/1.5094643
  • Fang B., H. Halawa, K. Pattabiram, M. Ripeanu, and S. Krishnamoorthy. 2019. "BonVoision: Leveraging Spatial Data Smoothness For Recovery From Memory Soft Errors." In Proceedings of the ACM International Conference on Supercomputing (ICS 2019), June 26-28, 2019, Phoenix, AZ, 484-496. New York, New York:ACM. PNNL-SA-143140. doi:10.1145/3330345.3330388
  • Ghosh P., S. Krishnamoorthy, and A. Kalyanaraman. 2019. "PaKman: Scalable Assembly of Large Genomes on Distributed Memory Machines." In IEEE International Parallel & Distributed Processing Symposium (IPDPS 2019), May 20-24, 2019), Rio de Janeiro, Brazil, 578-589. Piscataway, New Jersey:IEEE. PNNL-SA-138919. doi:10.1109/IPDPS.2019.00067
  • Joardar B.K., P. Ghosh, P.P. Pande, A. Kalyanaraman, and S. Krishnamoorthy. 2019. "NoC-enabled Software/Hardware Co-Design Framework for Accelerating k-mer Counting." In Proceedings of the 13th IEEE/ACM International Symposium on Networks-on-Chip (NOCS 2019), October 17-18, 2019, New York, Article No. 4. New York, New York:ACM. PNNL-SA-147489. doi:10.1145/3313231.3352367
  • Khan M.H., M. Halappanavar, T.J. Hagge, K. Kowalski, A. Pothen, and S. Krishnamoorthy. 2019. "Mapping Arbitrarily Sparse Two-body Interactions on One-dimensional Quantum Circuits." In IEEE 26th International Conference on High Performance Computing, Data, and Analytics (HiPC 2019), December 17-20, Hyderabad, India, 52-62. Los Alamitos, California:IEEE Computer Society. PNNL-SA-144919. doi:10.1109/HiPC.2019.00018
  • Kumar S., L. Eyraud-Dubois, and S. Krishnamoorthy. 2019. "Performance Models for Data Transfers: A Case Study with Molecular Chemistry Kernels." In Proceedings of the 48th International Conference on Parallel Processing (ICPP 2019), August 5-8, 2019, Kyoto, Japan, Article No. 81. New York, New York:ACM. PNNL-SA-143840. doi:10.1145/3337821.3337921
  • Mutlu E., K. Kowalski, and S. Krishnamoorthy. 2019. "Toward Generalized Tensor Algebra for ab initio Quantum Chemistry Methods." In Proceedings of the 6th ACM SIGPLAN International Workshop on Libraries, Languages, and Compilers for Array Programming (ARRAY 2019), June 22, 2019, Phoeniz, AZ, 46-56. New York, New York:ACM. PNNL-SA-142686. doi:10.1145/3315454.3329958
  • Nisa I., J. Li, A. Sukumaran-Rajan, P. Rawat, S. Krishnamoorthy, and P. Sadayappan. 2019. "An Efficient Mixed-Mode Representation of Sparse Tensors." In International Conference on High Performance Computing, Networking, Storage and Analysis, November 17-22, 2019, Denver, CO, Article No. a49. Los Alamitos, California:IEEE Computer Society. PNNL-SA-142737. doi:10.1145/3295500.3356216
  • Ren B., S. Balakrishna, Y. Jo, S. Krishnamoorthy, K. Agrawal, and M. Kulkarni. 2019. "Extracting SIMD Parallelism from Recursive Task-Parallel Programs." ACM Transactions on Parallel Computing 6, no. 4:Article No. 24. PNNL-SA-111445. doi:10.1145/3365663

2018

  • Hong C., A. Sukumaran-Rajan, J. Kim, P. Rawat, S. Krishnamoorthy, L. Pouchet, and F. Rastello, et al. 2018. "GPU Code Optimization Using Abstract Kernel Emulation and Sensitivity Analysis." ACM SIGPLAN Notices 53, no. 4:736-751. PNNL-SA-132802. doi:10.1145/3296979.3192397
  • Kestor G.G., B. Mutlu, J.B. Manzano Franco, O. Subasi, O. Unsal, and S. Krishnamoorthy. 2018. "Comparative Analysis of Soft-Error Detection Strategies: A Case Study with Iterative Methods." In Proceedings of the 15th ACM International Conference on Computing Frontiers (CF 2018), May 8-10, 2019, Ishia, Italy, 173-182. New York, New York:ACM. PNNL-SA-133097. doi:10.1145/3203217.3203240
  • Kestor G.G., I.B. Peng, R. Gioiosa, and S. Krishnamoorthy. 2018. "Understanding scale-dependent soft-error behavior of scientific applications." In Proceedings of the 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2018), May 1-4, 2018, Washington DC, 482-491. Piscataway, New Jersey:IEEE. PNNL-SA-132744. doi:10.1109/CCGRID.2018.00075
  • Kim J., A. Sukumaran-Rajan, C. Hong, A.R. Panyala, R. Srivastava, S. Krishnamoorthy, and P. Sadayappan. 2018. "Optimizing Tensor Contractions in CCSD(T) for Efficient Execution on GPUs." In Proceedings of the International Conference on Supercomputing (ICS 2018), June 12-15, 2018, Beijing, China, 96-106. New York, New York:ACM. PNNL-SA-134103. doi:10.1145/3205289.3205296
  • Mutlu B., G.G. Kestor, J.B. Manzano Franco, O. Unsal, S. Chatterjee, and S. Krishnamoorthy. 2018. "Characterization of the Impact of Soft Errors on Iterative Methods." In 25TH IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HiPC 2018), December 17-20, 2018, Bengaluru, India, 203-214. Los Alamitos, California:IEEE Computer Society. PNNL-SA-138072. doi:10.1109/HiPC.2018.00031
  • Mutlu E., A.R. Panyala, and S. Krishnamoorthy. 2018. "HPC Software Verification in Action: A Case Study with Tensor Transposition." In Second International Workshop on Software Correctness for HPC Applications (Correctness 2018), November 12, 2018, Dallas, TX, 9-16. Piscataway, New Jersey:IEEE. PNNL-SA-138083. doi:10.1109/Correctness.2018.00006
  • Roy P., S. Song, S. Krishnamoorthy, and X. Liu. 2018. "Lightweight Detection of Cache Conflicts." In Proceedings of the 2018 International Symposium on Code Generation and Optimization (CGO 2018), February 24-28, Vienna, Austria, 200-213. New York, New York:ACM. PNNL-SA-132742. doi:10.1145/3168819
  • Subasi O., and S. Krishnamoorthy. 2018. "On The Theory of Speculative Checkpointing: Time and Energy Considerations." In Proceedings of the 15th ACM International Conference on Computing Frontiers (CF 2018), May 8-10, 2018, Ischia, Italy, 165-172. New York, New York:ACM. PNNL-SA-132987. doi:10.1145/3203217.3203232
  • Subasi O., C. Chang, M. Erez, and S. Krishnamoorthy. 2018. "Characterizing the Impact of Soft Errors Affecting Floating-point ALUs using RTL-level Fault Injection." In Proceedings of the 47th International Conference on Parallel Processing, (ICPP 2018), August 13-16, 2018, Eugene, OR, Article No. 59. New York, New York:ACM. PNNL-SA-134868. doi:10.1145/3225058.3225089
  • Subasi O., R. Tipireddy, and S. Krishnamoorthy. 2018. "Quantification, Trade-off Analysis, and Optimal Checkpoint Placement for Reliability and Availability." In IEEE 25th International Conference on High Performance Computing (HiPC 2018), December 17-20, 2018, Bengaluru, India, 183-192. Los Alamitos, California:IEEE Computer Society. PNNL-SA-138205. doi:10.1109/HiPC.2018.00029
  • Subasi O., S. Di, L. Bautista-Gomez, P. Balaprakash, O. Unsal, J. Labarta, and A. Cristal, et al. 2018. "Exploring The Capabilities of Support Vector Machines in Detecting Silent Data Corruptions." Sustainable Computing: Informatics and Systems 19. PNNL-SA-131767. doi:10.1016/j.suscom.2018.01.004

2017

  • Kestor G., S. Krishnamoorthy, and W. Ma. 2017. "Localized Fault Recovery for Nested Fork-Join Programs." In Proceedings of the 31st IEEE International Parallel & Distributed Processing Symposium (IPDPS 2017), May 29-June 2, 2017, Orlando, Florida, 397-408. Los Alamitos, California:IEEE Computer Society. PNNL-SA-123481. doi:10.1109/IPDPS.2017.75
  • Lifflander J., and S. Krishnamoorthy. 2017. "Cache Locality Optimization for Recursive Programs." In Proceedings of the 38th ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI 2017), June 18-23, 2017, Barcelona, Spain, 1-16. New York, New York:ACM. PNNL-SA-123961. doi:10.1145/3140587.3062385
  • Panyala A.R., O. Subasi, M. Halappanavar, A. Kalyanaraman, D.G. Chavarria Miranda, and S. Krishnamoorthy. 2017. "Approximate Computing Techniques for Iterative Graph Algorithms." In IEEE 24th International Conference on High Performance Computing (HiPC 2017), December 18-21, 2017, Jaipur, India, 23 - 30. Los Alamitos, California:IEEE Computer Society. PNNL-SA-129904. doi:10.1109/HiPC.2017.00013
  • Rajbhandari S., F. Rastello, K. Kowalski, S. Krishnamoorthy, and P. Sadayappan. 2017. "Optimizing the Four-Index Integral Transform Using Data Movement Lower Bounds Analysis." In Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP 2017), February 4-8, 2017, Austin, Texas, 52, 327-340. New York, New York:ACM. PNNL-SA-122720. doi:10.1145/3155284.3018771
  • Ren B., S. Krishnamoorthy, K. Agrawal, and M. Kulkarni. 2017. "Exploiting Vector and Multicore Parallelsim for Recursive, Data- and Task-Parallel Programs." In Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PRoPP 2017), February 4-8, 2017, Austin, Texas, 117-130. New York, New York:ACM. PNNL-SA-124366. doi:10.1145/3018743.3018763
  • Sharma V.C., G. Gopalakrishnan, and S. Krishnamoorthy. 2017. "PRESAGE: Protecting Structured Address Generation against Soft Errors." In IEEE 23rd International Conference on High Performance Computing (HiPC), December 19-22, 2016, Hyderabad, India, 252-261. Los Alamitos, California:IEEE Computer Society. PNNL-SA-121138. doi:10.1109/HiPC.2016.037
  • Subasi O., S. Di, P. Balaprakash, O. Unsal, J. Labarta, A. Cristal, and S. Krishnamoorthy, et al. 2017. "MACORD: Online Adaptive Machine Learning Framework for Silent Error Detection." In IEEE International Conference on Cluster Computing (CLUSTER 2017), September 5-8, 2017, Honolulu, HI, 717-724. Los Alamitos, California:IEEE Computer Society. PNNL-SA-128115. doi:10.1109/CLUSTER.2017.128

2016

  • Arafat M.H., J. Dinan, S. Krishnamoorthy, P. Balaji, and P. Sadayappan. 2016. "Work stealing for GPU-accelerated parallel programs in a global address space framework." Concurrency and Computation. Practice & Experience 28, no. 13:3637-3654. PNNL-SA-115979. doi:10.1002/cpe.3747
  • Bao W., S. Krishnamoorthy, L. Pouchet, F. Rastello, and P. Sadayappan. 2016. "PolyCheck: Dynamic Verification of Iteration Space Transformations on Affine Programs." In ACM Symposium on Principles of Programming Languages(POPL 2016), January 20-22, 2016, St. Petersburg, Florida, 539-554. New York, New York:ACM. PNNL-SA-113963. doi:10.1145/2837614.2837656
  • Daily J.A., A. Kalyanaraman, S. Krishnamoorthy, and B. Ren. 2016. "On the Impact of Widening Vector Registers on Sequence Alignment." In 45th International Conference on Parallel Processing (ICPP 2016), August 15-19, 2016, Philadelphia, Pennsylvania, 506 - 515. Piscataway, New Jersey:IEEE. PNNL-SA-118504. doi:10.1109/ICPP.2016.65
  • Hong C., W. Bao, A. Cohen, S. Krishnamoorthy, L. Pouchet, F. Rastello, and J. Ramanujam, et al. 2016. "Effective Padding of Multi-Dimensional Arrays to Avoid Cache Conflict Misses." In Proceedings of the 37th ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI 2016), June 13-17, 2016, Santa Barbara, California, 51, 129-144. New York, New York:ACM. PNNL-SA-115708. doi:10.1145/2908080.2908123
  • Rajbhandari S., J. Kim, S. Krishnamoorthy, L. Pouchet, F. Rastello, R.J. Harrison, and P. Sadayappan. 2016. "A domain-specific compiler for a parallel multiresolution adaptive numerical simulation environment." In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC 2016), November 13-18, 2016, Salt Lake City, Utah, 468-479. Piscataway, New Jersey:IEEE. PNNL-SA-126067. doi:10.1109/SC.2016.39
  • Rajbhandari S., J. Kim, S. Krishnamoorthy, L. Pouchet, F. Rastello, R.J. Harrison, and P. Sadayappan. 2016. "On Fusing Recursive Traversals of K-d Trees." In Proceedings of the 25th International Conference on Compiler Construction (CC 2016), March 12-18, 2016, Barcelona, Spain, 152-162. New York, New York:ACM. PNNL-SA-115709. doi:10.1145/2892208.2892228
  • Sharma V.C., G. Gopalakrishnan, and S. Krishnamoorthy. 2016. "PRESAGE: Protecting Structured Address Generation against Soft Errors." In IEEE 23rd International Conference on High Performance Computing (HiPC 2016), December 19-22, 2016, Hyderabad, India. Los Alamitos, California:IEEE Computer Society. PNNL-SA-124186. doi:10.1109/HiPC.2016.037
  • Sharma V.C., G. Gopalakrishnan, and S. Krishnamoorthy. 2016. "Towards Resiliency Evaluation of Vector Programs." In IEEE International Parallel and Distributed Processing Symposium Workshops, May 23-27, 2016, Chicago,Illinois. Piscataway, New Jersey:IEEE. PNNL-SA-117059. doi:10.1109/IPDPSW.2016.187
  • Tao D., S. Song, S. Krishnamoorthy, P. Wu, X. Liang, E. Zhang, and D.J. Kerbyson, et al. 2016. "New-Sum: A Novel Online ABFT Scheme For General Iterative Methods." In Proceedings of the 25th ACM international Symposium on High-Performance and Distributed Computing (HPDC 2016), May 31-June 4, 2016, Kyoto, Japan, 43-55. New York, New York:ACM. PNNL-SA-117061. doi:10.1145/2907294.2907306

2015

  • Aga S.D., S. Krishnamoorthy, and S. Narayanasamy. 2015. "CilkSpec: Optimistic Concurrency for Cilk." In Supercomputing (SC15): Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, November 15-20, 2015, Austin, Texas, Paper No. 83. New York, New York:Association for Computing Machinery (ACM). PNNL-SA-111827. doi:10.1145/2807591.2807597
  • Chavarría-Miranda D., A. Panyala, W. Ma, A. Prantl, and S. Krishnamoorthy. 2015. "Global Transformations for Legacy Parallel Applications via Structural Analysis and Rewriting." Parallel Computing 43, no. March 2015:1-26. PNNL-SA-104508. doi:10.1016/j.parco.2015.01.001
  • Chavarría-Miranda D., M. Halappanavar, S. Krishnamoorthy, J.B. Manzano Franco, A. Vishnu, and A. Hoisie. 2015. "On the Impact of Execution Models: A Case Study in Computational Chemistry." In Joint International Workshop on High-level Parallel Programming Models and Supportive Environments (HIPS) and Large-Scale Parallel Processing (LSPP), held in conjunction with the 29th IEEE International Parallel & Distributed Processing Symposium Workshop (IPDPSW 2015), May 25-29, 2015, Hyderabad, India, 255-264. Piscataway, New Jersey:IEEE. PNNL-SA-108382. doi:10.1109/IPDPSW.2015.111
  • Daily J.A., A. Kalyanaraman, S. Krishnamoorthy, and A. Vishnu. 2015. "A Work Stealing Based Approach for Enabling Scalable Optimal Sequence Homology Detection." Journal of Parallel and Distributed Computing 79-80. PNNL-SA-103338. doi:10.1016/j.jpdc.2014.08.009
  • Kleese van Dam K., E.G. Stephan, B. Raju, I. Altintas, T.O. Elsethagen, and S. Krishnamoorthy. 2015. "Enabling Structured Exploration of Workflow Performance Variability in Extreme-Scale Environments." In 8th Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers (MTAGS) 2015, November 15, 2015, Austin, Texas. Chicago, Illinois:Data-Intensive Distributed Systems Laboratory. PNNL-SA-120941.
  • Krishnamoorthy S., J.A. Daily, A. Vishnu, and B.J. Palmer. 2015. "Global Arrays." In Programming Models for Parallel Computing. 113-128. Cambridge, Massachusetts:MIT Press. PNNL-SA-111177.
  • Ren B., Y. Jo, S. Krishnamoorthy, K. Agrawal, and M. Kulkarni. 2015. "Efficient Execution of Recursive Programs on Commodity Vector Hardware." In Proceedings of the 36th Annual ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI 2015), June 13-17, 2015, Portland, Oregon, 509-520. New York, New York:ACM. PNNL-SA-107984. doi:10.1145/2737924.2738004

2014

  • Arafat M.H., S. Krishnamoorthy, and P. Sadayappan. 2014. "Checksumming strategies for data in volatile memories." In 43rd International Conference on Parallel Processing Workshops (ICCPW 2014), September 9-12, 2014, Minneapolis, Minnesota, 245-254. Piscataway, New Jersey:IEEE. PNNL-SA-103603. doi:10.1109/ICPPW.2014.41
  • Chavarría-Miranda D., J.B. Manzano Franco, S. Krishnamoorthy, A. Vishnu, K.J. Barker, and A. Hoisie. 2014. "SCaLeM: A Framework for Characterizing and Analyzing Execution Models." In Beowulf 2014: Proceedings of the 20 Years of Beowulf Workshop on Honor of Thomas Sterling's 65th Birthday, October 13-14, 2014, Annapolis, Maryland, 34-43. New York, New York:ACM. PNNL-SA-105049. doi:10.1145/2737909.2737910
  • Kurt M.C., S. Krishnamoorthy, K. Agrawal, and G. Agrawal. 2014. "Fault-tolerant dynamic task graph scheduling." In International Conference for High Performance Computing, Storage and Analysis (SC14), November 16-21, 2014, New Orleans, Louisiana, 719-730. Piscataway, New Jersey:IEEE. PNNL-SA-103739. doi:10.1109/SC.2014.64
  • Lifflander J., E. Meneses, H. Menon, P. Miller, S. Krishnamoorthy, and L. Kale. 2014. "Scalable Replay with Partial-Order Dependencies for Message-Logging Fault Tolerance." In IEEE International Conference on Cluster Computing (CLUSTER 2014), September 22-26, 2014, Madrid, Spain, 19-28. Piscataway, New Jersey:IEEE. PNNL-SA-103978. doi:10.1109/CLUSTER.2014.6968739
  • Lifflander J., S. Krishnamoorthy, and L. Kale. 2014. "Optimizing Data Locality for Fork/Join Programs Using Constrained Work Stealing." In International Conference for High Performance Computing, Storage and Analysis (SC14), November 16-21, 2014, New Orleans, Louisiana, 857-868. Piscataway, New Jersey:IEEE. PNNL-SA-103776. doi:10.1109/SC.2014.75
  • Rajbhandari S., A. Nikam, P. Lai, K. Stock, S. Krishnamoorthy, and P. Sadayappan. 2014. "A Communication-Optimal Framework for Contracting Distributed Tensors." In International Conference for High Performance Computing, Storage and Analysis (SC14), November 16-21, 2014, New Orleans, Louisiana, 375-386. Piscataway, New Jersey:IEEE. PNNL-SA-103670. doi:10.1109/SC.2014.36
  • Rajbhandari S., A. Nikam, P. Lai, K. Stock, S. Krishnamoorthy, and P. Sadayappan. 2014. "CAST: Contraction Algorithm for Symmetric Tensors." In 43rd International Conference on Parallel Processing (ICPP 2014), September 9-12, 2014, Minneapolis, Minnesota, 261-272. Piscataway, New Jersey:IEEE. PNNL-SA-103757. doi:10.1109/ICPP.2014.35
  • Snir M., R.W. Wisniewski, J.A. Abraham, S. Adve, S. Bagchi, P. Balaji, and J. Belak, et al. 2014. "Addressing failures in exascale computing." International Journal of High Performance Computing Applications 28, no. 2:129-173. PNNL-SA-101991. doi:10.1177/1094342014522573
  • Tavarageri S., S. Krishnamoorthy, and P. Sadayappan. 2014. "Compiler-Assisted Detection of Transient Memory Errors." In Proceedings of the 35th ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI'14), June 9-11, 2014, Edinburgh, UK, 204-215. New York, New York:Association for Computing Machinery. PNNL-SA-100876. doi:10.1145/2666356.2594298

2013

  • Ali N., S. Krishnamoorthy, M. Halappanavar, and J.A. Daily. 2013. "Multi-fault Tolerance for Cartesian Data Distributions." International Journal of Parallel Programming 41, no. 3:469-493. PNNL-SA-87233. doi:10.1007/s10766-012-0218-5
  • Bhaskaran-Nair K., W. Ma, S. Krishnamoorthy, O. Villa, H.J. van Dam, E. Apra, and K. Kowalski. 2013. "Noniterative Multireference Coupled Cluster Methods on Heterogeneous CPU-GPU Systems." Journal of Chemical Theory and Computation 9, no. 4:1949-1957. PNNL-SA-92866.
  • Hermanns M., S. Krishnamoorthy, and f. wolf. 2013. "A scalable infrastructure for the performance analysis of passive target synchronization." Parallel Computing 39, no. 3:132-145. PNNL-SA-91011. doi:10.1016/j.parco.2012.09.002
  • Huo X., S. Krishnamoorthy, and G. Agrawal. 2013. "Efficient Scheduling of Recursive Control Flow on GPUs." In Proceedings of the 27th international ACM Conference on International Conference on Supercomputing (ICS '13), June 10-14, 2013, Eugene, Oregon, 409-420. New York, New York:Association for Computing Machinery (ACM). PNNL-SA-95026. doi:10.1145/2464996.2479870
  • Lai P., K. Stock, S. Rajbhandari, S. Krishnamoorthy, and P. Sadayappan. 2013. "A Framework for Load Balancing of Tensor Contraction Expressions via Dynamic Task Partitioning." In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC'13), November 17-21, 2013, Paper No. 13. New York, New York:Association for Computing Machinery (ACM). PNNL-SA-97996. doi:10.1145/2503210.2503290
  • Lifflander J., S. Krishnamoorthy, and L. Kale. 2013. "Steal Tree: Low-Overhead Tracing of Work Stealing Schedulers." In Proceedings of the 34th ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI '13), June 16-19, 2013, Seattle, Washington, 507-518. New York, New York:Association for Computing Machinery (ACM). PNNL-SA-93954. doi:10.1145/2491956.2462193
  • Ma W., S. Krishnamoorthy, O. Villa, K. Kowalski, and G. Agrawal. 2013. "Optimizing Tensor Contraction Expressions for Hybrid CPU-GPU Execution." Cluster Computing 16, no. 1:131-155. PNNL-SA-79187. doi:10.1007/s10586-011-0179-2

2012

  • Arafat M.H., P. Sadayappan, J. Dinan, S. Krishnamoorthy, and T.L. Windus. 2012. "Load Balancing of Dynamical Nucleation Theory Monte Carlo Simulations through Resource Sharing Barriers." In IEEE 26th International Parallel & Distributed Processing Symposium (IPDPS 2012), May 21-25, 2012, Shanghai, 258-295. Piscataway, New Jersey:IEEE. PNNL-SA-86025. doi:10.1109/IPDPS.2012.35
  • Chavarría-Miranda D., S. Krishnamoorthy, and A. Vishnu. 2012. "Global Futures: a multithreaded execution model for Global Arrays-based applications." In Proceedings of the 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2012), May 13-16, 2012, Ottawa, Canada, 393-401. Piscataway, New Jersey:Institute of Electrical and Electronics Engineers. PNNL-SA-85689. doi:10.1109/CCGrid.2012.105
  • Daily J.A., S. Krishnamoorthy, and A. Kalyanaraman. 2012. "Towards Scalable Optimal Sequence Homology Detection." In 19th International Conference on High Performance Computing (HiPC), December 18-22, 2012, Pune, India. Piscataway, New Jersey:Institute of Electrical and Electronics Engineers. PNNL-SA-90521. doi:10.1109/HiPC.2012.6507523
  • Dinan J., P. Balaji, J.R. Hammond, S. Krishnamoorthy, and V. Tipparaju. 2012. "Supporting the Global Arrays PGAS Model Using MPI One-Sided Communication." In IEEE 26th International Parallel & Distributed Processing Symposium (IPDPS 2012), May 21-25, 2012, Shanghai, 739-750. Piscataway, New Jersey:IEEE. PNNL-SA-85960. doi:10.1109/IPDPS.2012.72
  • Gioiosa R., and S. Krishnamoorthy. 2012. "Operating System Support for Fine-Grained Task Migration." In Workshop on Exascale Operating Systems and Runtime Software, October 4-5, 2012, Washington DC. Lemont, Illinois:Argonne National Laboratory. PNNL-SA-90923.
  • Hammond J.R., S. Krishnamoorthy, S. Shende, N.A. Romero, and A.D. Malony. 2012. "Performance Characterization of Global Address Space Applications: A Case Study with NWChem." Concurrency and Computation. Practice & Experience 24, no. 2:135.154. PNNL-SA-79365. doi:10.1002/cpe.1881
  • Lifflander J., S. Krishnamoorthy, and L. Kale. 2012. "Work Stealing and Persistence-based Load Balancers for Iterative Overdecomposed Applications." In HPDC 2012: Proceedings of the 21st International Symposium on High-Performance Parallel and Distributed Computing, June 18-22, 2012, Delft, The Netherlands, 137-148. New York, New York:Association for Computing Machinery (ACM). PNNL-SA-86555. doi:10.1145/2287076.2287103
  • Lu Q., X. Gao, S. Krishnamoorthy, G. Baumgartner, J. Ramanujam, and P. Sadayappan. 2012. "Empirical Performance Model-Driven Data Layout Optimization and Library Call Selection for Tensor Contraction Expressions." Journal of Parallel and Distributed Computing 72, no. 3:338-362. PNNL-SA-79327. doi:10.1016/j.jpdc.2011.09.006
  • Ma W., and S. Krishnamoorthy. 2012. "Data-driven Fault Tolerance for Work Stealing Computations." In ICS 2012: Proceedings of the 26th ACM International Conference on Supercomputing, June 25-29, 2012, Venice, Italy, 79-90. New York, New York:Association for Computing Machinery (ACM). PNNL-SA-86484. doi:10.1145/2304576.2304589
  • Ma W., S. Krishnamoorthy, and G. Agrawal. 2012. "Parameterized Micro-benchmarking: An Auto-tuning Approach for Complex Applications." In CF 2012: Proceedings of the 9th Conference on Computing Frontiers, May 15-17, 2012, Cagliari, Italy, 213-222. New York, New York:Association for Computing Machinery (ACM). PNNL-SA-86263. doi:10.1145/2212908.2212938
  • Panyala A.R., D. Chavarría-Miranda, and S. Krishnamoorthy. 2012. "On the use of term rewriting for performance optimization of legacy HPC applications." In The 41st International Conference on Parallel Processing (ICPP), September 10-13, 2012, Pittsburgh, Pennsylvania, 399-409. Los Alamitos, California:IEEE Computer Society. PNNL-SA-88626. doi:10.1109/ICPP.2012.44

2011

  • Ali N., S. Krishnamoorthy, M. Halappanavar, and J.A. Daily. 2011. "Tolerating Correlated Failures for Generalized Cartesian Distributions via Bipartite Matching." In Proceedings of the 8th ACM International Conference on Computing Frontiers (CF 2011), May 3-5, 2011, Ischia, Italy. New York, New York:Association for Computing Machinery. PNNL-SA-76095. doi:10.1145/2016604.2016649
  • Ali N., S. Krishnamoorthy, N. Govind, and B.J. Palmer. 2011. "A Redundant Communication Approach to Scalable Fault Tolerance in PGAS Programming Models." In Proceedings of the19th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP 2011), February 9-11, 2011, Ayia Napa, Cyprus, 24-31. Los Alamitos, California:IEEE Computer Society. PNNL-SA-75835.
  • Ali N., S. Krishnamoorthy, N. Govind, K. Kowalski, and P. Sadayappan. 2011. "Application-Specific Fault Tolerance via Data Access Characterization." In Proceedings of the 17th International European Conference on Parallel and Distributed Computing, (Euro-Par 2011), August 29-September 2, 2011, Bordeaux, France. Lecture Notes in Computer Science, edited by E Jeannot, R Namyst and J Roman, 6853, 340-352. Berlin:Springer-Verlag. PNNL-SA-79368. doi:10.1007/978-3-642-23397-5
  • Brabec J., S. Krishnamoorthy, H.J. van Dam, K. Kowalski, and J. Pittner. 2011. "Massively parallel implementation of the multi-reference Brillouin-Wigner CCSD method." Chemical Physics Letters 514, no. 4-6:347-351. PNNL-SA-80263. doi:10.1016/j.cplett.2011.08.016
  • Knapp R., K. Karavanic, S. Krishnamoorthy, and A. Marquez. 2011. "Power- and Cooling-Aware Parallel Performance Diagnosis." In Parallel and Distributed Computing and Systems (PDCS 2011), December 14-16, 2011, Dallas, Texas, Paper No. 757-114. Anaheim, California:ACTA Press. PNNL-SA-84331. doi:10.2316/P.2011.757-114
  • Kowalski K., R.M. Olson, S. Krishnamoorthy, V. Tipparaju, and E. Apra. 2011. "Role of Many-Body Effects in Describing Low-Lying Excited States of pi-Conjugated Chromophores: High-Level Equation-of-Motion Coupled-Cluster Studies of Fused Porphyrin Systems." Journal of Chemical Theory and Computation 7, no. 7:2200-2208. PNNL-SA-78691. doi:10.1021/ct200217y
  • Kowalski K., S. Krishnamoorthy, R.M. Olson, V. Tipparaju, and E. Apra. 2011. "Scalable implementations of accurate excited-state coupled cluster theories: application of high-level methods to porphyrin based systems." In Proceedings of International Conference for High Performance Computing, Networking, Storage and Analysis (SC'11), November 12-18, 2011, Seattle, Washington, Article No. 72. New York, New York:Association for Computing Machinery. PNNL-SA-78956. doi:10.1145/2063384.2063481
  • Ma W., S. Krishnamoorthy, and G. Agrawal. 2011. "Practical Loop Transformations for Tensor Contraction Expressions on Multi-Level Memory Hierarchies." In Compiler Construction: 20th International Conference, CC 2011, Held as Part of the Joint European Conferences on Theory and Practice of Software, ETAPS 2011, March 26 - April 3, 2011, Saarbrucken, Germany. Lecture Notes in Computer Science, edited by J Knoop, 6601, 266-285. Heidelberg:Springer-Verlag. PNNL-SA-77072. doi:10.1007/978-3-642-19861-8_15
  • Ma W., S. Krishnamoorthy, O. Villa, and K. Kowalski. 2011. "GPU-based implementations of the noniterative regularized-CCSD(T) corrections: applications to strongly correlated systems." Journal of Chemical Theory and Computation 7, no. 5:1316-1328. PNNL-SA-74991. doi:10.1021/ct1007247
  • Minnich R.G., C.L. Janssen, S. Krishnamoorthy, A. Marquez, M. Gokhale, P. Sadayappan, and E. Van Hensbergen, et al. 2011. "Fault Oblivious eXascale Whitepaper." In Proceedings of the 1st International Workshop on Runtime and Operating Systems for Supercomputers (ROSS 2011), held in conjunction with the 25th International Conference on Supercomputing, May 31, 2011, Tucson, Arizona, edited by K Iskra and T Hoefler, 17-24. New York, New York:Association for Computing Machinery. PNNL-SA-79579. doi:10.1145/1988796.1988800
  • Saraswat V.A., P. Kambadur, S. Kodali, D. Grove, and S. Krishnamoorthy. 2011. "Lifeline-based Global Load Balancing." In Proceedings of the 16th ACM Symposium on Principles and Practice of Parallel Programming (PPoPP 2011), February 12-16, 2011, San Antonio, TX, 201-211. New York, New York:Association for Computing Machinery. PNNL-SA-76960. doi:10.1145/1941553.1941582

2010

  • Chen L., O. Villa, S. Krishnamoorthy, and G.R. Gao. 2010. "Dynamic Load Balancing on Single- and Multi-GPU Systems." In Proceedings of the 24th IEEE International Symposium on Parallel & Distributed Processing (IPDPS 2010), 1-12. Piscataway, New Jersey:Institute of Electrical and Electronics Engineers. PNNL-SA-70333. doi:10.1109/IPDPS.2010.5470413
  • Dinan J.S., A. Singri, P. Sadayappan, and S. Krishnamoorthy. 2010. "Selective Recovery From Failures In A Task Parallel Programming Model." In Proceedings of the the 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing (CCGrid 2010), 709-714. Piscataway, New Jersey:Institute of Electrical and Electronics Engineers. PNNL-SA-70950. doi:10.1109/CCGRID.2010.34
  • Glaesemann K.R., N. Govind, S. Krishnamoorthy, and K. Kowalski. 2010. "EOMCC, MRPT, and TDDFT Studies of Charge Transfer Processes in Mixed-Valence Compounds: Application to the Spiro Molecule." Journal of Physical Chemistry A 114, no. 33:8764-8771. PNNL-SA-71344. doi:10.1021/jp101761d
  • Kowalski K., S. Krishnamoorthy, O. Villa, J.R. Hammond, and N. Govind. 2010. "Active-space completely-renormalized equation-of-motion coupled-cluster formalism: Excited-state studies of green fluorescent protein, free-base porphyrin, and oligoporphyrin dimer." Journal of Chemical Physics 132, no. 15:Article Number: 154103. PNNL-SA-68983.
  • Krishnamoorthy S., and K. Agarwal. 2010. "Scalable Communication Trace Compression." In The 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing, 408-417. Piscataway, New Jersey:Institute of Electrical and Electronics Engineers. PNNL-SA-70735. doi:10.1109/CCGRID.2010.111
  • Ma W., S. Krishnamoorthy, O. Villa, and K. Kowalski. 2010. "Acceleration of Streamed Tensor Contraction Expressions on GPGPU-based Clusters." In Proceedings of the IEEE International Conference on Cluster Computing (CLUSTER 2010), 207-216. Piscataway, New Jersey:Institute of Electrical and Electronic Engineers. PNNL-SA-73012. doi:10.1109/CLUSTER.2010.26
  • Siegel J., O. Villa, S. Krishnamoorthy, A. Tumeo, and X. Li. 2010. "Efficient Sparse Matrix-Matrix Multiplication on Heterogeneous High Performance Systems." In Proceedings of the IEEE International Conference on Cluster Computing Workshops and Posters (CLUSTER WORKSHOPS 2010), 1-8. Piscataway, New Jersey:Institute of Electrical and Electronic Engineers. PNNL-SA-74056. doi:10.1109/CLUSTERWKSP.2010.5613109
  • Villa O., L. Chen, and S. Krishnamoorthy. 2010. "High Performance Molecular Dynamic Simulation on Single and Multi-GPU Systems." In IEEE International Symposium on Circuits and Systems (ISCAS 2010), 3805-3808. Piscataway, New Jersey:Institute of Electrical and Electronics Engineers. PNNL-SA-70332. doi:10.1109/ISCAS.2010.5537723

2009

  • Dinan J.S., D.B. Larkins, P. Sadayappan, S. Krishnamoorthy, and J. Nieplocha. 2009. "Scalable Work Stealing." In Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, Article No. 53. New York, New York:Association for Computing Machinery. PNNL-SA-67261. doi:10.1145/1654059.1654113
  • Hartono A., M.M. Baskaran, C. Bastoul, A. Cohen, S. Krishnamoorthy, B. Norris, and J. Ramanujam, et al. 2009. "Parametric Multi-Level Tiling of Imperfectly Nested Loops." In Proceedings of the 23rd International Conference on Supercomputing, 147-157. New York, New York:Association for Computing Machinery. PNNL-SA-65872. doi:10.1145/1542275.1542301
  • Hartono A., Q. Lu, T. Henretty, S. Krishnamoorthy, H. Zhang, G. Baumgartner, and D.E. Bernholdt, et al. 2009. "Performance Optimization of Tensor Contraction Expressions for Many Body Methods in Quantum Chemistry." Journal of Physical Chemistry A 113, no. 45:12715 - 12723. PNNL-SA-67482. doi:10.1021/jp9051215
  • Lu Q., C. Alias, U. Bondhugula, T. Henretty, S. Krishnamoorthy, J. Ramanujam, and A. Rountev, et al. 2009. "Data Layout Transformation for Enhancing Data Locality on NUCA Chip Multiprocessors." In The Eighteenth International Conference on Parallel Architectures and Compilation Techniques (PACT 2009). Los Alamitos, California:IEEE Computer Society. PNNL-SA-70540. doi:10.1109/PACT.2009.36
  • Villa O., D. Chavarría-Miranda, V. Gurumoorthi, A. Marquez, and S. Krishnamoorthy. 2009. "Effects of floating-point non-associativity on numerical computations on massively multithreaded systems." In Cray User Group Meeting (CUG 2009), May 4-7, 2009, Atlanta, Georgia. Corvallis, Oregon:Cray User Group, Inc. PNNL-SA-65946.
  • Villa O., S. Krishnamoorthy, J. Nieplocha, and D.M. Brown. 2009. "Scalable Transparent Checkpoint-Restart of Global Address Space Applications on Virtual Machines over Infiniband." In Proceedings of the 6th ACM Conference on Computing Frontiers, 197-206. New York, New York:Association for Computing Machinery. PNNL-SA-64617. doi:10.1145/1531743.1531776
  • Vydyanathan N., S. Krishnamoorthy, G.M. Sabin, U.V. Catalyurek, T. Kurc, P. Sadayappan, and J.H. Saltz. 2009. "An Integrated Approach to Locality-Conscious Processor Allocation and Scheduling of Mixed-Parallel Applications." IEEE Transactions on Parallel and Distributed Systems 20, no. 8:1158-1172. PNNL-SA-62411.

2008

  • Dinan J.S., S. Krishnamoorthy, D.B. Larkins, J. Nieplocha, and P. Sadayappan. 2008. "Scioto: A Framework for Global-ViewTask Parallelism." In 37th International Conference on Parallel Processing - ICPP '08, 586-593. Piscataway, New Jersey:IEEE. PNNL-SA-60689. doi:10.1109/ICPP.2008.44
  • Larkins D.B., J.S. Dinan, S. Krishnamoorthy, S. Parthasarathy, A. Rountev, and P. Sadayappan. 2008. "Global Trees: A Framework for Linked Data Structures on Distributed Memory Parallel Systems." In The International Conference for High Performance Computing, Networking, Storage, and Analysis. 2008. SC 2008.. Piscataway, New Jersey:Institute of Electrical and Electronics Engineers (IEEE). PNNL-SA-60967. doi:10.1109/SC.2008.5218880
  • Nieplocha J., S. Krishnamoorthy, M. Valiev, M. Krishnan, B.J. Palmer, and P. Sadayappan. 2008. "Integrated Data and Task Management for Scientific Applications." In Proceedings of ICCS: Lecture Notes in Computer Science, 5101, 20-31. Berlin:Springer-Verlag. PNNL-SA-59937. doi:10.1007/978-3-540-69384-0_6

2007

  • Krishnamoorthy S., J. Piernas Canovas, V. Tipparaju, J. Nieplocha, and P. Sadayappan. 2007. "Non-collective Parallel I/O for Global Address Space Programming Models." In IEEE International Conference on Cluster Computing: IEEE CLUSTER'2007, 41-49. Los Alamitos, New Mexico:IEEE Computer Society. PNNL-SA-57215. doi:10.1109/CLUSTR.2007.4629215
  • Krishnamoorthy S., U. Catalyurek, J. Nieplocha, A. Rountev, and P. Sadayappan. 2007. "Hypergraph Partitioning for Automatic Memory Hierarchy Management." In Conference on High Performance Networking and Computing. Proceedings of the 2006 ACM/IEEE Conference on Supercomputing SC '06, Tampa, FL, 11-17 Nov. 2006, 12 pages. New York, New York:ACM Press. PNNL-SA-50740. doi:10.1109/SC.2006.36

2006

  • Blocksome M., C. Archer, T. Inglett, P. McCarthy, M. Mundy, J. Ratterman, and A. Sidelnik, et al. 2006. "Design and Implementation of a One-Sided Communication Interface for the IBM eServer Blue Gene." In Supercomputing 2006, ACM/IEEE Conference, 54. Los Alamitos, California:IEEE Computer Society. PNNL-SA-51284. doi:10.1109/SC.2006.22
  • Krishnamoorthy S., G. Baumgartner, C. Lam, J. Nieplocha, and P. Sadayappan. 2006. "Layout Transformation Support for the Disk Resident Arrays Framework." Journal of Supercomputing 36, no. 2:153 - 170. PNNL-SA-46908. doi:10.1007/s11227-006-7955-4
  • Krishnamoorthy S., U. Catalyurek, J. Nieplocha, and P. Sadayappan. 2006. "An Approach to Locality-Conscious Load Balancing and Transparent Memory Hierarchy Management with a Global-Address-Space Parallel Programming Model." In 20th International Parallel and Distributed Symposium (IPDPS'06), 25-29 April 2006,, 8. Piscataway, New Jersey:Institute of Electrical and Electronics Engineers. PNNL-SA-48518. doi:10.1109/IPDPS.2006.1639719

2005

  • Krishnamoorthy S., J. Nieplocha, and P. Sadayappan. 2005. "Data and Computation Abstractions for Dynamic and Irregular Computations." In Proceedings of the High Performance Computing-HiPC 2005. 12th International Conference. Published in Lecture Notes in Computer Science, 3769, 258-269. Berlin:Springer Verlag. PNNL-SA-46884.