Biography

Sumit Purohit is a data scientist at Pacific Northwest National Laboratory (PNNL) in the Physical and Computational Sciences Directorate. His research focuses on scalable graph analytics, knowledge graphs, and geometric deep learning while working on various Defense Advanced Research Projects Agency (DARPA) and Department of Energy projects in cybersecurity, national security, and social network domains. He is a key contributor on the PNNL DARPA Modeling Adversarial Activity where he developed novel capabilities to characterize and query real-world interactions using graph-based methods. He is principal investigator on internal laboratory-directed research and development projects that explore hybrid attack graphs for cyber physical systems such as microgrids to support resilience assessment experimentations. 

Research Interest

  • Graph Neural Networks
  • Machine Learning
  • Temporal Graph
  • Knowledge Graph
  • Ontology

Disciplines and Skills

  • Cybersecurity
  • Deep Learning
  • Graph Analytics
  • High Performance Computing (HPC)
  • Knowledge Representation

Education

PhD, Computer Science, Washington State University

MS, Computer Science, Northeastern University

BS, Information Technology, Jai Narain Vyas University

 

Affiliations and Professional Service

  • Association of Computing Machinery
  • Institute of Electrical and Electronics Engineers

Publications

2023

  • Subasi O., S. Purohit, A. Bhattacharya, and S. Chatterjee. 2023. "Impact-Driven Sampling Strategies for Hybrid Attack Graphs." In IEEE International Symposium on Technologies for Homeland Security (HST 2022) November 14-15, 2022, Virtual, Online, 1-7. Piscataway, New Jersey:IEEE. PNNL-SA-178629. doi:10.1109/HST56032.2022.10025439

2022

  • Das S., A. Dutta, S. Purohit, E. Serra, M. Halappanavar, and A. Pothen. 2022. "Towards Automatic Mapping of Vulnerabilities to Attack Patterns using Large Language Models." In IEEE International Symposium on Technologies for Homeland Security (HST 2022), November 14-15, 2022, Boston, MA, 1-7. Piscataway, New Jersey:IEEE. PNNL-SA-174298. doi:10.1109/HST56032.2022.10025459
  • Dutta A., S. Purohit, A. Bhattacharya, and O. Bel. 2022. "Cyber Attack Sequences Generation for Electric Power Grid." In 10th Workshop on Modeling and Simulation of Cyber-Physical Energy Systems (MSCPES 2022), May 3, 2022, Milan, Italy, 1-6. Piscataway, New Jersey:IEEE. PNNL-SA-170464. doi:10.1109/MSCPES55116.2022.9770105
  • Purohit S., G. Chin, L. Holder, and L. Holder. 2022. "ITeM: Independent Temporal Motifs to Summarize and Compare Temporal Networks." Intelligence Data Analysis 26, no. 4:1071 - 1096. PNNL-SA-158727. doi:10.3233/IDA-205698
  • Purohit S., N. Van, and G. Chin. 2022. "Semantic Property Graph for Scalable Knowledge Graph Analytics." In IEEE International Conference on Big Data (Big Data 2021), December 15-18, 2021, Orlando, FL, 2672-2677. Piscataway, New Jersey:IEEE. PNNL-SA-167338. doi:10.1109/BigData52589.2021.9671547
  • Purohit S., P.S. Mackey, W.P. Smith, M.P. Dunning, M.J. Orren, T.M. Langlie-Miletich, and R.D. Deshmukh, et al. 2022. "Transactional Knowledge Graph Generation To Model Adversarial Activities." In IEEE International Conference on Big Data (Big Data 2021), December 15-18, 2021, Orlando, FL, 2662-2671. Piscataway, New Jersey:IEEE. PNNL-SA-167380. doi:10.1109/BigData52589.2021.9672016

2021

  • Purohit S., P.S. Mackey, J.D. Zucker, A. Bohra, R.D. Deshmukh, and G. Chin. 2021. "QLiG: Query Like a Graph For Subgraph Matching." In IEEE Artificial Intelligence & Knowledge Engineering (AIKE 2021), December 1-3, 2021, Laguna Hills, CA, 121-128. Piscataway, New Jersey:IEEE. PNNL-SA-167142. doi:10.1109/AIKE52691.2021.00025

2020

  • Joaristi M., S. Purohit, R.D. Deshmukh, and G. Chin. 2020. "Data-Driven Template Discovery Using Graph Convolutional Neural Networks." In IEEE International Conference on Big Data (Big Data 2020), December 10-13, 2020, Atlanta, GA, 2534-2538. Piscataway, New Jersey:IEEE. PNNL-SA-156967. doi:10.1109/BigData50022.2020.9378318

2018

  • Choudhury S., S. Purohit, P. Lin, Y. Wu, L.B. Holder, and K. Agarwal. 2018. "Percolator: Scalable Pattern Discovery in Dynamic Graphs." In Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining (WSDM 2018), February 5-9, 2018, Los Angeles, California, 759-762. New York, New York:ACM. PNNL-SA-128916. doi:10.1145/3159652.3160589
  • Cottam J.A., S. Purohit, P.S. Mackey, and G. Chin. 2018. "Multi-Channel Large Network Simulation Including Adversarial Activity." In IEEE International Conference on Big Data (Big Data 2018), December 10-13, 2018, Seattle, WA, 3947-3950. Piscataway, New Jersey:IEEE. PNNL-SA-138688. doi:10.1109/BigData.2018.8622305
  • Purohit S., L. Holder, and G. Chin. 2018. "Temporal Graph Generation Based on a Distribution of Temporal Motifs." In 14TH INTERNATIONAL WORKSHOP ON MINING AND LEARNING WITH GRAPHS (MLG 2018), August 20, 2018, London, United Kingdom. PNNL-SA-134797.

2017

  • Choudhury S., K. Agarwal, S. Purohit, B. Zhang, M.A. Pirrung, W.P. Smith, and M. Thomas. 2017. "NOUS: Construction and Querying of Dynamic Knowledge Graphs." In IEEE 33rd International Conference on Data Engineering (ICDE 2017), April 19-22, 2017, San Diego, California, 1563-1565. Piscataway, New Jersey:IEEE. PNNL-SA-123812. doi:10.1109/ICDE.2017.228
  • Purohit S., S. Choudhury, and L.B. Holder. 2017. "Application-Specific Graph Sampling for Frequent Subgraph Mining and Community Detection." In IEEE International Conference on Big Data (Big Data 2017), December 11-14, 2017, Boston, Massachusetts, 1000-1005. Piscataway, New Jersey:IEEE. PNNL-SA-128679. doi:10.1109/BigData.2017.8258022
  • Visweswara Sathanur A., S. Choudhury, C.A. Joslyn, and S. Purohit. 2017. "When Labels Fall Short: Property Graph Simulation via Blending of Network Structure and Vertex Attributes." In ACMProceedings of the 2017 ACM on Conference on Information and Knowledge Management (CIKM 2017), November 6-10, 2017, Singapore, 2287-2290. New York, New York:ACM. PNNL-SA-126433. doi:10.1145/3132847.3133065

2016

  • Purohit S., P.R. Paulson, and L.R. Rodriguez. 2016. "User-Centric Approach for Benchmark RDF Data Generator in Big Data Performance Analysis." In 10th International Conference on Semantic Computing (ICSC 2016), Laguna Hills, California, 179-180. Piscataway, New Jersey:IEEE. PNNL-SA-114421. doi:10.1109/ICSC.2016.88
  • Purohit S., W.P. Smith, A.R. Chappell, P. West, B. Lee, E.G. Stephan, and P. Fox. 2016. "Effective Tooling for Linked Data Publishing in Scientific Research." In 10th IEEE International Conference on Semantic Computing (ICSC 2016), February 4-6, 2016, Laguna Hills, California, 24-31. Piscataway, New Jersey:IEEE. PNNL-SA-113974. doi:10.1109/ICSC.2016.87
  • Zhang B., S. Choudhury, M. Al-Hasan, X. Ning, K. Agarwal, S. Purohit, and P. Pesantez. 2016. "Trust from the past: Bayesian Personalized Ranking based Link Prediction in Knowledge Graphs." In Third Workshop on Mining Networks and Graphs: A Big Data Analytic Challenge (MNG 2016), May 7, 2016, Miami, Florida. Philadelphia, Pennsylvania:Society for Industrial and Applied Mathematics (SIAM). PNNL-SA-115550.

2015

  • Chappell A.R., J.R. Weaver, S. Purohit, W.P. Smith, K.L. Schuchardt, P. West, and B. Lee, et al. 2015. "Enhancing the Impact of Science Data: Toward Data Discovery and Reuse." In Proceedings of the IEEE/ACIS 14th International Conference on Computer and Information Science (ICIS) 2015, June 28-July 1, 2015, Las Vegas, Nevada, edited by T Ito, Y Kim and N Fukuta, 271-277. Piscataway, New Jersey:Institute of Electrical & Electronics Engineers (IEEE). PNNL-SA-107823. doi:10.1109/ICIS.2015.7166605
  • White S.K., S. Purohit, and L.W. Boyd. 2015. "Using GTO-Velo to Facilitate Communication and Sharing of Simulation Results in Support of the Geothermal Technologies Office Code Comparison Study." In Proceedings of the 40th Workshop on Geothermal Reservoir Engineering, January 26-28, 2015, Stanford, California, Paper No. SGP-TR-204. Stanford, California:Stanford University. PNNL-SA-107564.

2014

  • Freedman V.L., X. Chen, S.A. Finsterle, M.D. Freshley, I. Gorton, L.J. Gosink, and E. Keating, et al. 2014. "A high-performance workflow system for subsurface simulation." Environmental Modelling & Software 55. PNNL-SA-92680. doi:10.1016/j.envsoft.2014.01.030
  • Weaver J.R., V.G. Castellana, A. Morari, A. Tumeo, S. Purohit, A.R. Chappell, and D.J. Haglin, et al. 2014. "Toward a Data Scalable Solution for Facilitating Discovery of Science Resources." Parallel Computing 40, no. 10:682-696. PNNL-SA-101643. doi:10.1016/j.parco.2014.08.002

2013

  • Chappell A.R., S. Choudhury, J.T. Feo, D.J. Haglin, A. Morari, S. Purohit, and K.L. Schuchardt, et al. 2013. "Toward a Data Scalable Solution for Facilitating Discovery of Scientific Data Resources." In DISCS-2013: Proceedings of the International Workshop on Data-Intensive Scalable Computing Systems, November 18, 2013, Denver, CO, 55-60. New York, New York:Association for Computing Machinery. PNNL-SA-98169. doi:10.1145/2534645.2534655
  • Gorton I., J. Yin, B.A. Akyol, S. Ciraci, T. Critchlow, Y. Liu, and T.D. Gibson, et al. 2013. "GridOPTICS(TM) A Novel Software Framework for Integrating Power Grid Data Storage, Management and Analysis." In Proceedings of the 46th Hawaii International Conference on System Sciences (HICSS-46), January 7-10, 2013, Maui, Hawaii, edited by RH Sprague, Jr., 2167 -2176. Los Alamitos, California:IEEE Computer Society. PNNL-SA-88768. doi:10.1109/HICSS.2013.243
  • Scheibe T.D., M.D. White, S.K. White, C. Sivaramakrishnan, S. Purohit, G.D. Black, and R. Podgorney, et al. 2013. "Simulation of Enhanced Geothermal Systems: A Benchmarking and Code Intercomparison Study." In MODFLOW and More 2013: Translating Science into Practice, June 2-5, Golden, Colorado. Golden, Colorado:Integrated Ground Water Modeling Center. PNNL-SA-94774.
  • White S.K., L.J. Gosink, C. Sivaramakrishnan, G.D. Black, S. Purohit, D.H. Bacon, and Z. Hou, et al. 2013. "Implementations of a Flexible Framework for Managing Geologic Sequestration Modeling Projects." Energy Procedia 37. PNNL-SA-91339. doi:10.1016/j.egypro.2013.06.296

2012

  • Gorton I., C. Sivaramakrishnan, G.D. Black, S.K. White, S. Purohit, C.S. Lansing, and M.C. Madison, et al. 2012. "Velo: A Knowledge Management Framework for Modeling and Simulation." Computing in Science & Engineering 14, no. 2:12-23. PNNL-SA-81912. doi:10.1109/MCSE.2011.116
  • Schuchardt K.L., D.A. Agarwal, S.A. Finsterle, C.W. Gable, I. Gorton, L.J. Gosink, and E. Keating, et al. 2012. "AKUNA - INTEGRATED TOOLSETS SUPPORTING ADVANCED SUBSURFACE FLOW AND TRANSPORT SIMULATIONS FOR ENVIRONMETAL MANAGEMENT." In International Conference on Computational Methods in Water Resources (CMWR 2012), June 17-22, 2012, Champaign, IL. Washington, District Of Columbia:US Department of Energy, Office of Science. PNNL-SA-86251.

2011

  • Gorton I., C. Sivaramakrishnan, G.D. Black, S.K. White, S. Purohit, M.C. Madison, and K.L. Schuchardt. 2011. "Velo: Riding the Knowledge Management Wave for Simulation and Modeling." In 4th International Workshop on Software Engineering for Computational Science and Engineering (SECSE 2011), Co-located with the 33rd International Conference on Software Engineering (ICSE 2011) May 21-28, 2011, Honolulu, Hawaii, 32-40. New York, New York:Association for Computing Machinery. PNNL-SA-78215. doi:10.1145/1985782.1985788
  • Yin J., A.V. Kulkarni, S. Purohit, I. Gorton, and B.A. Akyol. 2011. "Scalable Real Time Data Management for Smart Grid." In Proceedings of the Middleware 2011 Industry Track, part of the 12th ACM/IFIP/USENIX International Middleware Conference, December 12-16, 2011, Lisbon, Portugal, Article No. 1. New York, New York:Association for Computing Machinery. PNNL-SA-83332. doi:10.1145/2090181.2090182