Skip to Main Content U.S. Department of Energy
Fundamental and Computational Sciences Directorate

Staff information

Sumit Purohit

Data Scientist
Pacific Northwest National Laboratory
PO Box 999
MSIN: J4-30
Richland, WA 99352


Sumit serves as Computer Scientist for PNNL's Algorithm & Analysis team in Data Science group. He works on Knowledge Management Systems, Graph Mining, Semantic Web and Data Analysis. His recent research work in the field of knowledge graph and graph mining can be found here:

Research Interests

  • Knowledge Discovery
  • Graph Mining
  • Knowledge Management
  • Semantic Association
  • Ontology

Education and Credentials

  • MS, Computer Science, Northeastern University Boston, 2009
  • B.E., Information Technology, MBM Engineering College, Jodhpur, India, 2006

Affiliations and Professional Service

  • Member of Association of Computing Machinery
  • Member of IEEE

PNNL Publications


  • Choudhury S., S. Purohit, P. Lin, Y. Wu, L.B. Holder, and K. Agarwal. 2018. "Percolator: Scalable Pattern Discovery in Dynamic Graphs." In Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining (WSDM 2018), February 5-9, 2018, Los Angeles, California, 759-762. New York, New York:ACM. PNNL-SA-128916. doi:10.1145/3159652.3160589


  • Choudhury S., K. Agarwal, S. Purohit, B. Zhang, M.A. Pirrung, W.P. Smith, and M. Thomas. 2017. "NOUS: Construction and Querying of Dynamic Knowledge Graphs." In IEEE 33rd International Conference on Data Engineering (ICDE 2017), April 19-22, 2017, San Diego, California, 1563-1565. Piscataway, New Jersey:IEEE. PNNL-SA-123812. doi:10.1109/ICDE.2017.228
  • Purohit S., S. Choudhury, and L.B. Holder. 2017. "Application-Specific Graph Sampling for Frequent Subgraph Mining and Community Detection." In IEEE International Conference on Big Data (Big Data 2017), December 11-14, 2017, Boston, Massachusetts, 1000-1005. Piscataway, New Jersey:IEEE. PNNL-SA-128679. doi:10.1109/BigData.2017.8258022
  • Visweswara Sathanur A., S. Choudhury, C.A. Joslyn, and S. Purohit. 2017. "When Labels Fall Short: Property Graph Simulation via Blending of Network Structure and Vertex Attributes." In ACMProceedings of the 2017 ACM on Conference on Information and Knowledge Management (CIKM 2017), November 6-10, 2017, Singapore, 2287-2290. New York, New York:ACM. PNNL-SA-126433. doi:10.1145/3132847.3133065


  • Purohit S., P.R. Paulson, and L.R. Rodriguez. 2016. "User-Centric Approach for Benchmark RDF Data Generator in Big Data Performance Analysis." In 10th International Conference on Semantic Computing (ICSC 2016), Laguna Hills, California, 179-180. Piscataway, New Jersey:IEEE. PNNL-SA-114421. doi:10.1109/ICSC.2016.88
  • Purohit S., W.P. Smith, A.R. Chappell, P. West, B. Lee, E.G. Stephan, and P. Fox. 2016. "Effective Tooling for Linked Data Publishing in Scientific Research." In 10th IEEE International Conference on Semantic Computing (ICSC 2016), February 4-6, 2016, Laguna Hills, California, 24-31. Piscataway, New Jersey:IEEE. PNNL-SA-113974. doi:10.1109/ICSC.2016.87
  • Zhang B., S. Choudhury, M. Al-Hasan, X. Ning, K. Agarwal, S. Purohit, and P. Pesantez. 2016. "Trust from the past: Bayesian Personalized Ranking based Link Prediction in Knowledge Graphs." In Third Workshop on Mining Networks and Graphs: A Big Data Analytic Challenge (MNG 2016), May 7, 2016, Miami, Florida. Philadelphia, Pennsylvania:Society for Industrial and Applied Mathematics (SIAM). PNNL-SA-115550.


  • Chappell A.R., J.R. Weaver, S. Purohit, W.P. Smith, K.L. Schuchardt, P. West, and B. Lee, et al. 2015. "Enhancing the Impact of Science Data: Toward Data Discovery and Reuse." In Proceedings of the IEEE/ACIS 14th International Conference on Computer and Information Science (ICIS) 2015, June 28-July 1, 2015, Las Vegas, Nevada, edited by T Ito, Y Kim and N Fukuta, 271-277. Piscataway, New Jersey:Institute of Electrical & Electronics Engineers (IEEE). PNNL-SA-107823. doi:10.1109/ICIS.2015.7166605
  • Paulson P.R., S. Purohit, and L.R. Rodriguez. 2015. HPC Analytics Support: Requirements for Uncertainty Quantification Benchmarks. PNNL-24435. Richland, WA: Pacific Northwest National Laboratory.
  • White S.K., S. Purohit, and L.W. Boyd. 2015. "Using GTO-Velo to Facilitate Communication and Sharing of Simulation Results in Support of the Geothermal Technologies Office Code Comparison Study." In Proceedings of the 40th Workshop on Geothermal Reservoir Engineering, January 26-28, 2015, Stanford, California, Paper No. SGP-TR-204. Stanford, California:Stanford University. PNNL-SA-107564.


  • Freedman V.L., X. Chen, S.A. Finsterle, M.D. Freshley, I. Gorton, L.J. Gosink, and E. Keating, et al. 2014. "A high-performance workflow system for subsurface simulation." Environmental Modelling & Software 55. PNNL-SA-92680. doi:10.1016/j.envsoft.2014.01.030
  • Weaver J.R., V.G. Castellana, A. Morari, A. Tumeo, S. Purohit, A.R. Chappell, and D.J. Haglin, et al. 2014. "Toward a Data Scalable Solution for Facilitating Discovery of Science Resources." Parallel Computing 40, no. 10:682-696. PNNL-SA-101643. doi:10.1016/j.parco.2014.08.002


  • Chappell A.R., S. Choudhury, J.T. Feo, D.J. Haglin, A. Morari, S. Purohit, and K.L. Schuchardt, et al. 2013. "Toward a Data Scalable Solution for Facilitating Discovery of Scientific Data Resources." In DISCS-2013: Proceedings of the International Workshop on Data-Intensive Scalable Computing Systems, November 18, 2013, Denver, CO, 55-60. New York, New York:Association for Computing Machinery. PNNL-SA-98169. doi:10.1145/2534645.2534655
  • Gorton I., J. Yin, B.A. Akyol, S. Ciraci, T. Critchlow, Y. Liu, and T.D. Gibson, et al. 2013. "GridOPTICS(TM) A Novel Software Framework for Integrating Power Grid Data Storage, Management and Analysis." In Proceedings of the 46th Hawaii International Conference on System Sciences (HICSS-46), January 7-10, 2013, Maui, Hawaii, edited by RH Sprague, Jr., 2167 -2176. Los Alamitos, California:IEEE Computer Society. PNNL-SA-88768. doi:10.1109/HICSS.2013.243
  • Scheibe T.D., M.D. White, S.K. White, C. Sivaramakrishnan, S. Purohit, G.D. Black, and R. Podgorney, et al. 2013. "Simulation of Enhanced Geothermal Systems: A Benchmarking and Code Intercomparison Study." In MODFLOW and More 2013: Translating Science into Practice, June 2-5, Golden, Colorado. Golden, Colorado:Integrated Ground Water Modeling Center. PNNL-SA-94774.
  • White S.K., L.J. Gosink, C. Sivaramakrishnan, G.D. Black, S. Purohit, D.H. Bacon, and Z. Hou, et al. 2013. "Implementations of a Flexible Framework for Managing Geologic Sequestration Modeling Projects." Energy Procedia 37. PNNL-SA-91339. doi:10.1016/j.egypro.2013.06.296


  • Gorton I., C. Sivaramakrishnan, G.D. Black, S.K. White, S. Purohit, C.S. Lansing, and M.C. Madison, et al. 2012. "Velo: A Knowledge Management Framework for Modeling and Simulation." Computing in Science & Engineering 14, no. 2:12-23. PNNL-SA-81912. doi:10.1109/MCSE.2011.116


  • Gorton I., C. Sivaramakrishnan, G.D. Black, S.K. White, S. Purohit, M.C. Madison, and K.L. Schuchardt. 2011. "Velo: Riding the Knowledge Management Wave for Simulation and Modeling." In 4th International Workshop on Software Engineering for Computational Science and Engineering (SECSE 2011), Co-located with the 33rd International Conference on Software Engineering (ICSE 2011) May 21-28, 2011, Honolulu, Hawaii, 32-40. New York, New York:Association for Computing Machinery. PNNL-SA-78215. doi:10.1145/1985782.1985788
  • Yin J., A.V. Kulkarni, S. Purohit, I. Gorton, and B.A. Akyol. 2011. "Scalable Real Time Data Management for Smart Grid." In Proceedings of the Middleware 2011 Industry Track, part of the 12th ACM/IFIP/USENIX International Middleware Conference, December 12-16, 2011, Lisbon, Portugal, Article No. 1. New York, New York:Association for Computing Machinery. PNNL-SA-83332. doi:10.1145/2090181.2090182

Science at PNNL

Core Research Areas

User Facilities

Centers & Institutes

Research Highlights

View All Research Highlights & Staff Accomplishments

RSS Feed