Skip to Main Content U.S. Department of Energy
Fundamental and Computational Sciences Directorate

Staff information

Sriram

Sriram Krishnamoorthy

Scientist
Pacific Northwest National Laboratory
PO Box 999
MSIN: J4-30
Richland, WA 99352

Biography

Sriram Krishnamoorthy currently is a research scientist and the System Software and Applications Team Leader in PNNL’s High Performance Computing group, where he focuses on parallel programming models, fault tolerance, and compile-time/runtime optimizations for high-performance computing. He has more than 70 peer-reviewed conference and journal publications, receiving Best Paper awards for his publications at the International Conference on High Performance Computing (HiPC'03) and the International Parallel and Distributed Processing Symposium (IPDPS'04). In 2013, he received a U.S. Department of Energy Early Career award, granting him $2.5 million over five years to explore exascale computing advancements. That year, he also earned PNNL’s Ronald L. Brodzinski Award for Early Career Exceptional Achievement. In 2008, he received The Ohio State University’s Outstanding Researcher award. Sriram also is a senior member of the Institute of Electrical and Electronics Engineers. He earned his B.E. from the College of Engineering, Guindy (Chennai, India) and M.S. and Ph.D. degrees from The Ohio State University.

Research Interests

  • High-performance computing; Parallel programming models; Fault tolerance; Runtime systems; Loop transformations

Affiliations and Professional Service

  • IEEE senior member
  • ACM professional member

Awards and Recognitions

  • 2014 Best paper finalist, Supercomputing (SC)
  • 2014 Best student paper finalist, Supercomputing (SC)
  • 2014 Best paper nominee, IEEE Cluster
  • 2013 U.S. Department of Energy (DOE) Early Career award
  • 2013 Ronald L. Brodzinski Award for Early Career Exceptional Achievement award (PNNL)
  • 2013 Senior member, Institute of Electrical and Electronics Engineers (IEEE)
  • 2012 ASCR/ASC Exascale Operating Systems and Runtime (OS/R) Technical Council member
  • 2012 ASCR/ASC Resilience Technical Council member
  • 2008 Outstanding Researcher Award. Ohio State University (Computer Science and Engineering Department) 2007 SC’07 HPC Challenge Award - as part of IBM X10 team
  • 2007 IBM PhD Fellowship
  • 2007 Ohio State University Presidential Fellowship
  • 2006 Travel award, Supercomputing conference (SC)
  • 2004 Best paper award, International Parallel and Distributed Processing Symposium (IPDPS)
  • 2003 Best paper award, International Conference on High Performance Computing (HiPC)
  • 2003 Travel award, International Conference on Cluster Computing (CLUSTER)
  • 2001 Travel award, International Conference on High Performance Computing (HiPC)

PNNL Publications

2018

  • Seo S., A. Amer, P. Balaji, C. Bordage, G. Bosilca, A. Brooks, and P. Carns, et al. 2018. "Argobots: A Lightweight Low-Level Threading and Tasking Framework." IEEE Transactions on Parallel and Distributed Systems 29, no. 3:512-526. PNNL-SA-130264. doi:10.1109/TPDS.2017.2766062

2017

  • Kestor G., S. Krishnamoorthy, and W. Ma. 2017. "Localized Fault Recovery for Nested Fork-Join Programs." In Proceedings of the 31st IEEE International Parallel & Distributed Processing Symposium (IPDPS 2017), May 29-June 2, 2017, Orlando, Florida, 397-408. Los Alamitos, California:IEEE Computer Society. PNNL-SA-123481. doi:10.1109/IPDPS.2017.75
  • Lifflander J., and S. Krishnamoorthy. 2017. "Cache Locality Optimization for Recursive Programs." In Proceedings of the 38th ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI 2017), June 18-23, 2017, Barcelona, Spain, 1-16. New York, New York:ACM. PNNL-SA-123961. doi:10.1145/3140587.3062385
  • Panyala A.R., O. Subasi, M. Halappanavar, A. Kalyanaraman, D.G. Chavarria Miranda, and S. Krishnamoorthy. 2017. "Approximate Computing Techniques for Iterative Graph Algorithms." In IEEE 24th International Conference on High Performance Computing (HiPC 2017), December 18-21, 2017, Jaipur, India, 23 - 30. Los Alamitos, California:IEEE Computer Society. PNNL-SA-129904. doi:10.1109/HiPC.2017.00013
  • Rajbhandari S., F. Rastello, K. Kowalski, S. Krishnamoorthy, and P. Sadayappan. 2017. "Optimizing the Four-Index Integral Transform Using Data Movement Lower Bounds Analysis." In Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP 2017), February 4-8, 2017, Austin, Texas, 52, 327-340. New York, New York:ACM. PNNL-SA-122720. doi:10.1145/3155284.3018771
  • Ren B., S. Krishnamoorthy, K. Agrawal, and M. Kulkarni. 2017. "Exploiting Vector and Multicore Parallelsim for Recursive, Data- and Task-Parallel Programs." In Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PRoPP 2017), February 4-8, 2017, Austin, Texas, 117-130. New York, New York:ACM. PNNL-SA-124366. doi:10.1145/3018743.3018763
  • Sharma V.C., G. Gopalakrishnan, and S. Krishnamoorthy. 2017. "PRESAGE: Protecting Structured Address Generation against Soft Errors." In IEEE 23rd International Conference on High Performance Computing (HiPC), December 19-22, 2016, Hyderabad, India, 252-261. Los Alamitos, California:IEEE Computer Society. PNNL-SA-121138. doi:10.1109/HiPC.2016.037

2016

  • Arafat M.H., J. Dinan, S. Krishnamoorthy, P. Balaji, and P. Sadayappan. 2016. "Work stealing for GPU-accelerated parallel programs in a global address space framework." Concurrency and Computation. Practice & Experience 28, no. 13:3637-3654. PNNL-SA-115979. doi:10.1002/cpe.3747
  • Bao W., C. Hong, S. Chunduri, S. Krishnamoorthy, L. Pouchet, F. Rastello, and P. Sadayappan. 2016. "Static and dynamic frequency scaling on multicore CPUs." ACM Transactions on Architecture and Code Optimization 13, no. 4:51. PNNL-SA-121775. doi:10.1145/3011017
  • Bao W., S. Krishnamoorthy, L. Pouchet, F. Rastello, and P. Sadayappan. 2016. "PolyCheck: Dynamic Verification of Iteration Space Transformations on Affine Programs." In ACM Symposium on Principles of Programming Languages(POPL 2016), January 20-22, 2016, St. Petersburg, Florida, 539-554. New York, New York:ACM. PNNL-SA-113963. doi:10.1145/2837614.2837656
  • Daily J.A., A. Kalyanaraman, S. Krishnamoorthy, and B. Ren. 2016. "On the Impact of Widening Vector Registers on Sequence Alignment." In 45th International Conference on Parallel Processing (ICPP 2016), August 15-19, 2016, Philadelphia, Pennsylvania, 506 - 515. Piscataway, New Jersey:IEEE. PNNL-SA-118504. doi:10.1109/ICPP.2016.65
  • Hong C., W. Bao, A. Cohen, S. Krishnamoorthy, L. Pouchet, F. Rastello, and J. Ramanujam, et al. 2016. "Effective Padding of Multi-Dimensional Arrays to Avoid Cache Conflict Misses." In Proceedings of the 37th ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI 2016), June 13-17, 2016, Santa Barbara, California, 51, 129-144. New York, New York:ACM. PNNL-SA-115708. doi:10.1145/2908080.2908123
  • Kurt M., S. Krishnamoorthy, G. Agrawal, and B. Ren. 2016. "User-Assisted Store Recycling for Dynamic Task Graph Schedulers." ACM Transactions on Architecture and Code Optimization 13, no. 4:55. PNNL-SA-122158. doi:10.1145/3018111
  • Rajbhandari S., J. Kim, S. Krishnamoorthy, L. Pouchet, F. Rastello, R.J. Harrison, and P. Sadayappan. 2016. "A domain-specific compiler for a parallel multiresolution adaptive numerical simulation environment." In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC 2016), November 13-18, 2016, Salt Lake City, Utah, 468-479. Piscataway, New Jersey:IEEE. PNNL-SA-126067. doi:10.1109/SC.2016.39
  • Rajbhandari S., J. Kim, S. Krishnamoorthy, L. Pouchet, F. Rastello, R.J. Harrison, and P. Sadayappan. 2016. "On Fusing Recursive Traversals of K-d Trees." In Proceedings of the 25th International Conference on Compiler Construction (CC 2016), March 12-18, 2016, Barcelona, Spain, 152-162. New York, New York:ACM. PNNL-SA-115709. doi:10.1145/2892208.2892228
  • Sharma V.C., G. Gopalakrishnan, and S. Krishnamoorthy. 2016. "PRESAGE: Protecting Structured Address Generation against Soft Errors." In IEEE 23rd International Conference on High Performance Computing (HiPC 2016), December 19-22, 2016, Hyderabad, India. Los Alamitos, California:IEEE Computer Society. PNNL-SA-124186. doi:10.1109/HiPC.2016.037
  • Sharma V.C., G. Gopalakrishnan, and S. Krishnamoorthy. 2016. "Towards Resiliency Evaluation of Vector Programs." In IEEE International Parallel and Distributed Processing Symposium Workshops, May 23-27, 2016, Chicago,Illinois. Piscataway, New Jersey:IEEE. PNNL-SA-117059. doi:10.1109/IPDPSW.2016.187
  • Tao D., S. Song, S. Krishnamoorthy, P. Wu, X. Liang, E. Zhang, and D.J. Kerbyson, et al. 2016. "New-Sum: A Novel Online ABFT Scheme For General Iterative Methods." In Proceedings of the 25th ACM international Symposium on High-Performance and Distributed Computing (HPDC 2016), May 31-June 4, 2016, Kyoto, Japan, 43-55. New York, New York:ACM. PNNL-SA-117061. doi:10.1145/2907294.2907306

2015

  • Aga S.D., S. Krishnamoorthy, and S. Narayanasamy. 2015. "CilkSpec: Optimistic Concurrency for Cilk." In Supercomputing (SC15): Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, November 15-20, 2015, Austin, Texas, Paper No. 83. New York, New York:Association for Computing Machinery (ACM). PNNL-SA-111827. doi:10.1145/2807591.2807597
  • Chavarrķa-Miranda D., A. Panyala, W. Ma, A. Prantl, and S. Krishnamoorthy. 2015. "Global Transformations for Legacy Parallel Applications via Structural Analysis and Rewriting." Parallel Computing 43, no. March 2015:1-26. PNNL-SA-104508. doi:10.1016/j.parco.2015.01.001
  • Chavarrķa-Miranda D., M. Halappanavar, S. Krishnamoorthy, J.B. Manzano Franco, A. Vishnu, and A. Hoisie. 2015. "On the Impact of Execution Models: A Case Study in Computational Chemistry." In Joint International Workshop on High-level Parallel Programming Models and Supportive Environments (HIPS) and Large-Scale Parallel Processing (LSPP), held in conjunction with the 29th IEEE International Parallel & Distributed Processing Symposium Workshop (IPDPSW 2015), May 25-29, 2015, Hyderabad, India, 255-264. Piscataway, New Jersey:IEEE. PNNL-SA-108382. doi:10.1109/IPDPSW.2015.111
  • Daily J.A., A. Kalyanaraman, S. Krishnamoorthy, and A. Vishnu. 2015. "A Work Stealing Based Approach for Enabling Scalable Optimal Sequence Homology Detection." Journal of Parallel and Distributed Computing 79-80. PNNL-SA-103338. doi:10.1016/j.jpdc.2014.08.009
  • Kleese van Dam K., E.G. Stephan, B. Raju, I. Altintas, T.O. Elsethagen, and S. Krishnamoorthy. 2015. "Enabling Structured Exploration of Workflow Performance Variability in Extreme-Scale Environments." In 8th Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers (MTAGS) 2015, November 15, 2015, Austin, Texas. Chicago, Illinois:Data-Intensive Distributed Systems Laboratory. PNNL-SA-120941.
  • Krishnamoorthy S., J.A. Daily, A. Vishnu, and B.J. Palmer. 2015. "Global Arrays." In Programming Models for Parallel Computing. 113-128. Cambridge, Massachusetts:MIT Press. PNNL-SA-111177.
  • Ren B., Y. Jo, S. Krishnamoorthy, K. Agrawal, and M. Kulkarni. 2015. "Efficient Execution of Recursive Programs on Commodity Vector Hardware." In Proceedings of the 36th Annual ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI 2015), June 13-17, 2015, Portland, Oregon, 509-520. New York, New York:ACM. PNNL-SA-107984. doi:10.1145/2737924.2738004

2014

  • Arafat M.H., S. Krishnamoorthy, and P. Sadayappan. 2014. "Checksumming strategies for data in volatile memories." In 43rd International Conference on Parallel Processing Workshops (ICCPW 2014), September 9-12, 2014, Minneapolis, Minnesota, 245-254. Piscataway, New Jersey:IEEE. PNNL-SA-103603. doi:10.1109/ICPPW.2014.41
  • Chavarrķa-Miranda D., J.B. Manzano Franco, S. Krishnamoorthy, A. Vishnu, K.J. Barker, and A. Hoisie. 2014. "SCaLeM: A Framework for Characterizing and Analyzing Execution Models." In Beowulf 2014: Proceedings of the 20 Years of Beowulf Workshop on Honor of Thomas Sterling's 65th Birthday, October 13-14, 2014, Annapolis, Maryland, 34-43. New York, New York:ACM. PNNL-SA-105049. doi:10.1145/2737909.2737910
  • Kurt M.C., S. Krishnamoorthy, K. Agrawal, and G. Agrawal. 2014. "Fault-tolerant dynamic task graph scheduling." In International Conference for High Performance Computing, Storage and Analysis (SC14), November 16-21, 2014, New Orleans, Louisiana, 719-730. Piscataway, New Jersey:IEEE. PNNL-SA-103739. doi:10.1109/SC.2014.64
  • Lifflander J., E. Meneses, H. Menon, P. Miller, S. Krishnamoorthy, and L. Kale. 2014. "Scalable Replay with Partial-Order Dependencies for Message-Logging Fault Tolerance." In IEEE International Conference on Cluster Computing (CLUSTER 2014), September 22-26, 2014, Madrid, Spain, 19-28. Piscataway, New Jersey:IEEE. PNNL-SA-103978. doi:10.1109/CLUSTER.2014.6968739
  • Lifflander J., S. Krishnamoorthy, and L. Kale. 2014. "Optimizing Data Locality for Fork/Join Programs Using Constrained Work Stealing." In International Conference for High Performance Computing, Storage and Analysis (SC14), November 16-21, 2014, New Orleans, Louisiana, 857-868. Piscataway, New Jersey:IEEE. PNNL-SA-103776. doi:10.1109/SC.2014.75
  • Rajbhandari S., A. NIkam, P. Lai, K. Stock, S. Krishnamoorthy, and P. Sadayappan. 2014. "A Communication-Optimal Framework for Contracting Distributed Tensors." In International Conference for High Performance Computing, Storage and Analysis (SC14), November 16-21, 2014, New Orleans, Louisiana, 375-386. Piscataway, New Jersey:IEEE. PNNL-SA-103670. doi:10.1109/SC.2014.36
  • Rajbhandari S., A. NIkam, P. Lai, K. Stock, S. Krishnamoorthy, and P. Sadayappan. 2014. "CAST: Contraction Algorithm for Symmetric Tensors." In 43rd International Conference on Parallel Processing (ICPP 2014), September 9-12, 2014, Minneapolis, Minnesota, 261-272. Piscataway, New Jersey:IEEE. PNNL-SA-103757. doi:10.1109/ICPP.2014.35
  • Snir M., R.W. Wisniewski, J.A. Abraham, S. Adve, S. Bagchi, P. Balaji, and J. Belak, et al. 2014. "Addressing failures in exascale computing." International Journal of High Performance Computing Applications 28, no. 2:129-173. PNNL-SA-101991. doi:10.1177/1094342014522573
  • Tavarageri S., S. Krishnamoorthy, and P. Sadayappan. 2014. "Compiler-Assisted Detection of Transient Memory Errors." In Proceedings of the 35th ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI'14), June 9-11, 2014, Edinburgh, UK, 204-215. New York, New York:Association for Computing Machinery. PNNL-SA-100876. doi:10.1145/2666356.2594298

2013

  • Ali N., S. Krishnamoorthy, M. Halappanavar, and J.A. Daily. 2013. "Multi-fault Tolerance for Cartesian Data Distributions." International Journal of Parallel Programming 41, no. 3:469-493. PNNL-SA-87233. doi:10.1007/s10766-012-0218-5
  • Bhaskaran-Nair K., W. Ma, S. Krishnamoorthy, O. Villa, H.J. van Dam, E. Apra, and K. Kowalski. 2013. "Noniterative Multireference Coupled Cluster Methods on Heterogeneous CPU-GPU Systems." Journal of Chemical Theory and Computation 9, no. 4:1949-1957. PNNL-SA-92866.
  • Huo X., S. Krishnamoorthy, and G. Agrawal. 2013. "Efficient Scheduling of Recursive Control Flow on GPUs." In Proceedings of the 27th international ACM Conference on International Conference on Supercomputing (ICS '13), June 10-14, 2013, Eugene, Oregon, 409-420. New York, New York:Association for Computing Machinery (ACM). PNNL-SA-95026. doi:10.1145/2464996.2479870
  • Lai P., K. Stock, S. Rajbhandari, S. Krishnamoorthy, and P. Sadayappan. 2013. "A Framework for Load Balancing of Tensor Contraction Expressions via Dynamic Task Partitioning." In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC'13), November 17-21, 2013, Paper No. 13. New York, New York:Association for Computing Machinery (ACM). PNNL-SA-97996. doi:10.1145/2503210.2503290
  • Lifflander J., S. Krishnamoorthy, and L. Kale. 2013. "Steal Tree: Low-Overhead Tracing of Work Stealing Schedulers." In Proceedings of the 34th ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI '13), June 16-19, 2013, Seattle, Washington, 507-518. New York, New York:Association for Computing Machinery (ACM). PNNL-SA-93954. doi:10.1145/2491956.2462193
  • Ma W., S. Krishnamoorthy, O. Villa, K. Kowalski, and G. Agrawal. 2013. "Optimizing Tensor Contraction Expressions for Hybrid CPU-GPU Execution." Cluster Computing 16, no. 1:131-155. PNNL-SA-79187. doi:10.1007/s10586-011-0179-2

2012

  • Arafat M.H., P. Sadayappan, J. Dinan, S. Krishnamoorthy, and T.L. Windus. 2012. "Load Balancing of Dynamical Nucleation Theory Monte Carlo Simulations through Resource Sharing Barriers." In IEEE 26th International Parallel & Distributed Processing Symposium (IPDPS 2012), May 21-25, 2012, Shanghai, 258-295. Piscataway, New Jersey:IEEE. PNNL-SA-86025. doi:10.1109/IPDPS.2012.35
  • Chavarrķa-Miranda D., S. Krishnamoorthy, and A. Vishnu. 2012. "Global Futures: a multithreaded execution model for Global Arrays-based applications." In Proceedings of the 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2012), May 13-16, 2012, Ottawa, Canada, 393-401. Piscataway, New Jersey:Institute of Electrical and Electronics Engineers. PNNL-SA-85689. doi:10.1109/CCGrid.2012.105
  • Daily J.A., S. Krishnamoorthy, and A. Kalyanaraman. 2012. "Towards Scalable Optimal Sequence Homology Detection." In 19th International Conference on High Performance Computing (HiPC), December 18-22, 2012, Pune, India. Piscataway, New Jersey:Institute of Electrical and Electronics Engineers. PNNL-SA-90521. doi:10.1109/HiPC.2012.6507523
  • Dinan J., P. Balaji, J.R. Hammond, S. Krishnamoorthy, and V. Tipparaju. 2012. "Supporting the Global Arrays PGAS Model Using MPI One-Sided Communication." In IEEE 26th International Parallel & Distributed Processing Symposium (IPDPS 2012), May 21-25, 2012, Shanghai, 739-750. Piscataway, New Jersey:IEEE. PNNL-SA-85960. doi:10.1109/IPDPS.2012.72
  • Gioiosa R., and S. Krishnamoorthy. 2012. "Operating System Support for Fine-Grained Task Migration." In Workshop on Exascale Operating Systems and Runtime Software, October 4-5, 2012, Washington DC. Lemont, Illinois:Argonne National Laboratory. PNNL-SA-90923.
  • Hammond J.R., S. Krishnamoorthy, S. Shende, N.A. Romero, and A.D. Malony. 2012. "Performance Characterization of Global Address Space Applications: A Case Study with NWChem." Concurrency and Computation. Practice & Experience 24, no. 2:135.154. PNNL-SA-79365. doi:10.1002/cpe.1881
  • Lifflander J., S. Krishnamoorthy, and L. Kale. 2012. "Work Stealing and Persistence-based Load Balancers for Iterative Overdecomposed Applications." In HPDC 2012: Proceedings of the 21st International Symposium on High-Performance Parallel and Distributed Computing, June 18-22, 2012, Delft, The Netherlands, 137-148. New York, New York:Association for Computing Machinery (ACM). PNNL-SA-86555. doi:10.1145/2287076.2287103
  • Lu Q., X. Gao, S. Krishnamoorthy, G. Baumgartner, J. Ramanujam, and P. Sadayappan. 2012. "Empirical Performance Model-Driven Data Layout Optimization and Library Call Selection for Tensor Contraction Expressions." Journal of Parallel and Distributed Computing 72, no. 3:338-362. PNNL-SA-79327. doi:10.1016/j.jpdc.2011.09.006
  • Ma W., and S. Krishnamoorthy. 2012. "Data-driven Fault Tolerance for Work Stealing Computations." In ICS 2012: Proceedings of the 26th ACM International Conference on Supercomputing, June 25-29, 2012, Venice, Italy, 79-90. New York, New York:Association for Computing Machinery (ACM). PNNL-SA-86484. doi:10.1145/2304576.2304589
  • Ma W., S. Krishnamoorthy, and G. Agrawal. 2012. "Parameterized Micro-benchmarking: An Auto-tuning Approach for Complex Applications." In CF 2012: Proceedings of the 9th Conference on Computing Frontiers, May 15-17, 2012, Cagliari, Italy, 213-222. New York, New York:Association for Computing Machinery (ACM). PNNL-SA-86263. doi:10.1145/2212908.2212938
  • Panyala A.R., D. Chavarrķa-Miranda, and S. Krishnamoorthy. 2012. "On the use of term rewriting for performance optimization of legacy HPC applications." In The 41st International Conference on Parallel Processing (ICPP), September 10-13, 2012, Pittsburgh, Pennsylvania, 399-409. Los Alamitos, California:IEEE Computer Society. PNNL-SA-88626. doi:10.1109/ICPP.2012.44

2011

  • Ali N., S. Krishnamoorthy, M. Halappanavar, and J.A. Daily. 2011. "Tolerating Correlated Failures for Generalized Cartesian Distributions via Bipartite Matching." In Proceedings of the 8th ACM International Conference on Computing Frontiers (CF 2011), May 3-5, 2011, Ischia, Italy. New York, New York:Association for Computing Machinery. PNNL-SA-76095. doi:10.1145/2016604.2016649
  • Ali N., S. Krishnamoorthy, N. Govind, and B.J. Palmer. 2011. "A Redundant Communication Approach to Scalable Fault Tolerance in PGAS Programming Models." In Proceedings of the19th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP 2011), February 9-11, 2011, Ayia Napa, Cyprus, 24-31. Los Alamitos, California:IEEE Computer Society. PNNL-SA-75835.
  • Ali N., S. Krishnamoorthy, N. Govind, K. Kowalski, and P. Sadayappan. 2011. "Application-Specific Fault Tolerance via Data Access Characterization." In Proceedings of the 17th International European Conference on Parallel and Distributed Computing, (Euro-Par 2011), August 29-September 2, 2011, Bordeaux, France. Lecture Notes in Computer Science, edited by E Jeannot, R Namyst and J Roman, 6853, 340-352. Berlin:Springer-Verlag. PNNL-SA-79368. doi:10.1007/978-3-642-23397-5
  • Brabec J., S. Krishnamoorthy, H.J. van Dam, K. Kowalski, and J. Pittner. 2011. "Massively parallel implementation of the multi-reference Brillouin-Wigner CCSD method." Chemical Physics Letters 514, no. 4-6:347-351. PNNL-SA-80263. doi:10.1016/j.cplett.2011.08.016
  • Knapp R., K. Karavanic, S. Krishnamoorthy, and A. Marquez. 2011. "Power- and Cooling-Aware Parallel Performance Diagnosis." In Parallel and Distributed Computing and Systems (PDCS 2011), December 14-16, 2011, Dallas, Texas, Paper No. 757-114. Anaheim, California:ACTA Press. PNNL-SA-84331. doi:10.2316/P.2011.757-114
  • Kowalski K., R.M. Olson, S. Krishnamoorthy, V. Tipparaju, and E. Apra. 2011. "Role of Many-Body Effects in Describing Low-Lying Excited States of pi-Conjugated Chromophores: High-Level Equation-of-Motion Coupled-Cluster Studies of Fused Porphyrin Systems." Journal of Chemical Theory and Computation 7, no. 7:2200-2208. PNNL-SA-78691. doi:10.1021/ct200217y
  • Kowalski K., S. Krishnamoorthy, R.M. Olson, V. Tipparaju, and E. Apra. 2011. "Scalable implementations of accurate excited-state coupled cluster theories: application of high-level methods to porphyrin based systems." In Proceedings of International Conference for High Performance Computing, Networking, Storage and Analysis (SC'11), November 12-18, 2011, Seattle, Washington, Article No. 72. New York, New York:Association for Computing Machinery. PNNL-SA-78956. doi:10.1145/2063384.2063481
  • Ma W., S. Krishnamoorthy, and G. Agrawal. 2011. "Practical Loop Transformations for Tensor Contraction Expressions on Multi-Level Memory Hierarchies." In Compiler Construction: 20th International Conference, CC 2011, Held as Part of the Joint European Conferences on Theory and Practice of Software, ETAPS 2011, March 26 - April 3, 2011, Saarbrucken, Germany. Lecture Notes in Computer Science, edited by J Knoop, 6601, 266-285. Heidelberg:Springer-Verlag. PNNL-SA-77072. doi:10.1007/978-3-642-19861-8_15
  • Ma W., S. Krishnamoorthy, O. Villa, and K. Kowalski. 2011. "GPU-based implementations of the noniterative regularized-CCSD(T) corrections: applications to strongly correlated systems." Journal of Chemical Theory and Computation 7, no. 5:1316-1328. PNNL-SA-74991. doi:10.1021/ct1007247
  • Minnich R.G., C.L. Janssen, S. Krishnamoorthy, A. Marquez, M. Gokhale, P. Sadayappan, and E. Van Hensbergen, et al. 2011. "Fault Oblivious eXascale Whitepaper." In Proceedings of the 1st International Workshop on Runtime and Operating Systems for Supercomputers (ROSS 2011), held in conjunction with the 25th International Conference on Supercomputing, May 31, 2011, Tucson, Arizona, edited by K Iskra and T Hoefler, 17-24. New York, New York:Association for Computing Machinery. PNNL-SA-79579. doi:10.1145/1988796.1988800
  • Saraswat V.A., P. Kambadur, S. Kodali, D. Grove, and S. Krishnamoorthy. 2011. "Lifeline-based Global Load Balancing." In Proceedings of the 16th ACM Symposium on Principles and Practice of Parallel Programming (PPoPP 2011), February 12-16, 2011, San Antonio, TX, 201-211. New York, New York:Association for Computing Machinery. PNNL-SA-76960. doi:10.1145/1941553.1941582

2010

  • Chen L., O. Villa, S. Krishnamoorthy, and G.R. Gao. 2010. "Dynamic Load Balancing on Single- and Multi-GPU Systems." In Proceedings of the 24th IEEE International Symposium on Parallel & Distributed Processing (IPDPS 2010), 1-12. Piscataway, New Jersey:Institute of Electrical and Electronics Engineers. PNNL-SA-70333. doi:10.1109/IPDPS.2010.5470413
  • Dinan J.S., A. Singri, P. Sadayappan, and S. Krishnamoorthy. 2010. "Selective Recovery From Failures In A Task Parallel Programming Model." In Proceedings of the the 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing (CCGrid 2010), 709-714. Piscataway, New Jersey:Institute of Electrical and Electronics Engineers. PNNL-SA-70950. doi:10.1109/CCGRID.2010.34
  • Glaesemann K.R., N. Govind, S. Krishnamoorthy, and K. Kowalski. 2010. "EOMCC, MRPT, and TDDFT Studies of Charge Transfer Processes in Mixed-Valence Compounds: Application to the Spiro Molecule." Journal of Physical Chemistry A 114, no. 33:8764-8771. PNNL-SA-71344. doi:10.1021/jp101761d
  • Kowalski K., S. Krishnamoorthy, O. Villa, J.R. Hammond, and N. Govind. 2010. "Active-space completely-renormalized equation-of-motion coupled-cluster formalism: Excited-state studies of green fluorescent protein, free-base porphyrin, and oligoporphyrin dimer." Journal of Chemical Physics 132, no. 15:Article Number: 154103. PNNL-SA-68983.
  • Krishnamoorthy S., and K. Agarwal. 2010. "Scalable Communication Trace Compression." In The 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing, 408-417. Piscataway, New Jersey:Institute of Electrical and Electronics Engineers. PNNL-SA-70735. doi:10.1109/CCGRID.2010.111
  • Ma W., S. Krishnamoorthy, O. Villa, and K. Kowalski. 2010. "Acceleration of Streamed Tensor Contraction Expressions on GPGPU-based Clusters." In Proceedings of the IEEE International Conference on Cluster Computing (CLUSTER 2010), 207-216. Piscataway, New Jersey:Institute of Electrical and Electronic Engineers. PNNL-SA-73012. doi:10.1109/CLUSTER.2010.26
  • Siegel J., O. Villa, S. Krishnamoorthy, A. Tumeo, and X. Li. 2010. "Efficient Sparse Matrix-Matrix Multiplication on Heterogeneous High Performance Systems." In Proceedings of the IEEE International Conference on Cluster Computing Workshops and Posters (CLUSTER WORKSHOPS 2010), 1-8. Piscataway, New Jersey:Institute of Electrical and Electronic Engineers. PNNL-SA-74056. doi:10.1109/CLUSTERWKSP.2010.5613109
  • Villa O., L. Chen, and S. Krishnamoorthy. 2010. "High Performance Molecular Dynamic Simulation on Single and Multi-GPU Systems." In IEEE International Symposium on Circuits and Systems (ISCAS 2010), 3805-3808. Piscataway, New Jersey:Institute of Electrical and Electronics Engineers. PNNL-SA-70332. doi:10.1109/ISCAS.2010.5537723

2009

  • Dinan J.S., D.B. Larkins, P. Sadayappan, S. Krishnamoorthy, and J. Nieplocha. 2009. "Scalable Work Stealing." In Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, Article No. 53. New York, New York:Association for Computing Machinery. PNNL-SA-67261. doi:10.1145/1654059.1654113
  • Hartono A., M.M. Baskaran, C. Bastoul, A. Cohen, S. Krishnamoorthy, B. Norris, and J. Ramanujam, et al. 2009. "Parametric Multi-Level Tiling of Imperfectly Nested Loops." In Proceedings of the 23rd International Conference on Supercomputing, 147-157. New York, New York:Association for Computing Machinery. PNNL-SA-65872. doi:10.1145/1542275.1542301
  • Hartono A., Q. Lu, t. henretty, S. Krishnamoorthy, h. zhang, G. Baumgartner, and D.E. Bernholdt, et al. 2009. "Performance Optimization of Tensor Contraction Expressions for Many Body Methods in Quantum Chemistry." Journal of Physical Chemistry A 113, no. 45:12715 - 12723. PNNL-SA-67482. doi:10.1021/jp9051215
  • Lu Q., C. Alias, U. Bondhugula, T. Henretty, S. Krishnamoorthy, J. Ramanujam, and A. Rountev, et al. 2009. "Data Layout Transformation for Enhancing Data Locality on NUCA Chip Multiprocessors." In The Eighteenth International Conference on Parallel Architectures and Compilation Techniques (PACT 2009). Los Alamitos, California:IEEE Computer Society. PNNL-SA-70540. doi:10.1109/PACT.2009.36
  • Villa O., D. Chavarrķa-Miranda, V. Gurumoorthi, A. Marquez, and S. Krishnamoorthy. 2009. "Effects of floating-point non-associativity on numerical computations on massively multithreaded systems." In Cray User Group Meeting (CUG 2009), May 4-7, 2009, Atlanta, Georgia. Corvallis, Oregon:Cray User Group, Inc. PNNL-SA-65946.
  • Villa O., S. Krishnamoorthy, J. Nieplocha, and D.M. Brown. 2009. "Scalable Transparent Checkpoint-Restart of Global Address Space Applications on Virtual Machines over Infiniband." In Proceedings of the 6th ACM Conference on Computing Frontiers, 197-206. New York, New York:Association for Computing Machinery. PNNL-SA-64617. doi:10.1145/1531743.1531776
  • Vydyanathan N., S. Krishnamoorthy, G.M. Sabin, U.V. Catalyurek, T. Kurc, P. Sadayappan, and J.H. Saltz. 2009. "An Integrated Approach to Locality-Conscious Processor Allocation and Scheduling of Mixed-Parallel Applications." IEEE Transactions on Parallel and Distributed Systems 20, no. 8:1158-1172. PNNL-SA-62411.

2008

  • Dinan J.S., S. Krishnamoorthy, D.B. Larkins, J. Nieplocha, and P. Sadayappan. 2008. "Scioto: A Framework for Global-ViewTask Parallelism." In 37th International Conference on Parallel Processing - ICPP '08, 586-593. Piscataway, New Jersey:IEEE. PNNL-SA-60689. doi:10.1109/ICPP.2008.44
  • Larkins D.B., J.S. Dinan, S. Krishnamoorthy, S. Parthasarathy, A. Rountev, and P. Sadayappan. 2008. "Global Trees: A Framework for Linked Data Structures on Distributed Memory Parallel Systems." In The International Conference for High Performance Computing, Networking, Storage, and Analysis. 2008. SC 2008.. Piscataway, New Jersey:Institute of Electrical and Electronics Engineers (IEEE). PNNL-SA-60967. doi:10.1109/SC.2008.5218880
  • Nieplocha J., S. Krishnamoorthy, M. Valiev, M. Krishnan, B.J. Palmer, and P. Sadayappan. 2008. "Integrated Data and Task Management for Scientific Applications." In Proceedings of ICCS: Lecture Notes in Computer Science, 5101, 20-31. Berlin:Springer-Verlag. PNNL-SA-59937. doi:10.1007/978-3-540-69384-0_6

2007

  • Krishnamoorthy S., J. Piernas Canovas, V. Tipparaju, J. Nieplocha, and P. Sadayappan. 2007. "Non-collective Parallel I/O for Global Address Space Programming Models." In IEEE International Conference on Cluster Computing: IEEE CLUSTER'2007, 41-49. Los Alamitos, New Mexico:IEEE Computer Society. PNNL-SA-57215. doi:10.1109/CLUSTR.2007.4629215
  • Krishnamoorthy S., U. Catalyurek, J. Nieplocha, A. Rountev, and P. Sadayappan. 2007. "Hypergraph Partitioning for Automatic Memory Hierarchy Management." In Conference on High Performance Networking and Computing. Proceedings of the 2006 ACM/IEEE Conference on Supercomputing SC '06, Tampa, FL, 11-17 Nov. 2006, 12 pages. New York, New York:ACM Press. PNNL-SA-50740. doi:10.1109/SC.2006.36

2006

  • Blocksome M., C. Archer, T. Inglett, P. McCarthy, M. Mundy, J. Ratterman, and A. Sidelnik, et al. 2006. "Design and Implementation of a One-Sided Communication Interface for the IBM eServer Blue Gene." In Supercomputing 2006, ACM/IEEE Conference, 54. Los Alamitos, California:IEEE Computer Society. PNNL-SA-51284. doi:10.1109/SC.2006.22
  • Krishnamoorthy S., G. Baumgartner, C. Lam, J. Nieplocha, and P. Sadayappan. 2006. "Layout Transformation Support for the Disk Resident Arrays Framework." Journal of Supercomputing 36, no. 2:153 - 170. PNNL-SA-46908. doi:10.1007/s11227-006-7955-4
  • Krishnamoorthy S., U. Catalyurek, J. Nieplocha, and P. Sadayappan. 2006. "An Approach to Locality-Conscious Load Balancing and Transparent Memory Hierarchy Management with a Global-Address-Space Parallel Programming Model." In 20th International Parallel and Distributed Symposium (IPDPS'06), 25-29 April 2006,, 8. Piscataway, New Jersey:Institute of Electrical and Electronics Engineers. PNNL-SA-48518. doi:10.1109/IPDPS.2006.1639719

2005

  • Krishnamoorthy S., J. Nieplocha, and P. Sadayappan. 2005. "Data and Computation Abstractions for Dynamic and Irregular Computations." In Proceedings of the High Performance Computing-HiPC 2005. 12th International Conference. Published in Lecture Notes in Computer Science, 3769, 258-269. Berlin:Springer Verlag. PNNL-SA-46884.

Science at PNNL

Core Research Areas

User Facilities

Centers & Institutes

Research Highlights

View All Research Highlights & Staff Accomplishments

RSS Feed

Contacts