Sameera Horawalavithana

Senior Data Scientist

Biography

Sameera Horawalavithana is a senior data scientist in Pacific Northwest National Laboratory’s (PNNL) Physical and Computational Science Directorate. He is a researcher and practitioner of large-scale artificial intelligence (AI) foundation models across multiple data modalities, such as language, and vision. His expertise on advancing AI for science, permitting and security problems has resulted in success in multiple DOE funded ($30M+) projects (e.g., PermitAI, Theseus, EXPERT, STEEL THREAD). The research that he led has highlighted in America's AI Action Plan (2025) and the White House Permitting Innovation Center. He has authored more than 40 peer reviewed papers, including publications at premier AI conferences and workshops. He also serves as an Associate Editor for the Institute of Electrical and Electronics Engineers (IEEE) Transactions on Artificial Intelligence journal, and an Area Chair for Empirical Methods in Natural Language Processing (EMNLP).

Research Interests

AI for science, permitting, and security
Natural language processing
Large multimodal models

Education

PhD in Computer Science and Engineering, University of South Florida
BS in Computer Science, University of Colombo

Affiliations and Professional Service

Funding

Expanding PermitAI Data and Application Solutions to Federal and State Level Environmental Permitting for Geothermal and Critical Minerals Projects in Alaska and Nevada - DOE Office of Critical Minerals and Energy Innovation (CMEI) (PI; FY 25 – FY27)
Theseus: A Computational Science Foundation Model - DOE, Office of Advanced Scientific Computing Research (PNNL PI, FY 25-FY27)
PermitAI: Democratizing AI for Improving Permitting Outcomes and Efficiency - DOE, Office of Policy (PI; FY23-FY26)
EXPERT 2.0: Reasoning about Global Proliferation Signatures with Evidence Tracing and Uncertainty Estimation (co-PI; FY23 $932K)

Editorial Board

Associated Editor, IEEE Transactions in Artificial Intelligence
Associated Editor, Nature PalComms, Humanities and Social Sciences Communications
Area Chair for Empirical Methods in Natural Language Processing (EMNLP)

Program Committee (Conferences)

Neural Information Processing Systems (NeurIPs)
The International Conference on Learning Representations (ICLR)
Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI)
The International Conference on Computational Linguistics (ACL)
The Conference on Empirical Methods in Natural Language Processing (EMNLP)
Conference on Language Modeling (COLM)
ACM Conference on AI and Agentic Systems (CAIS)

Awards and Recognitions

National recognition for the research work in America’s AI Action Plan (2025) and the White House Permitting Innovation Center
Best Paper Award (COVID Track), International Conference on Social Computing, Behavioral-Cultural Modeling, & Prediction and Behavior Representation in Modeling and Simulation (2021)
Best Computer Science Undergraduate Thesis award (2014)

Publications

2026

Horawalavithana, S., L. Phillips, I. Stewart, S. Munikoti, and K. Pazdernik. 2026. "Back to the Barn with LLAMAs: Evolving Pretrained LLM Backbones in Finetuning Vision Language Models." arXiv preprint arXiv:2604.10985.
Lilienthal, D., M. Mukherjee, and S. Horawalavithana. 2026. "Reward Design for Physical Reasoning in Vision-Language Models." arXiv preprint arXiv:2604.13993.
Munikoti, S., I. Stewart, S. Horawalavithana, H. Kvinge, T. Emerson, S. Thompson, and K. Pazdernik. 2026. "Generalist Multimodal AI: A Review of Architectures, Challenges and Opportunities." Neurocomputing, 132933.

2025

Chaturvedi, S., A. Acharya, R. Meyur, K. Hayashi, S. Munikoti, and S. Horawalavithana. 2025. "Evaluating the Robustness of Dense Retrievers in Interdisciplinary Domains." In KDD Workshop on Evaluation and Trustworthiness of Agentic and Generative AI Models.
Raab, R., M. Parker, D. Nally, S. Montgomery, A. Bernat, S. Munikoti, and S. Horawalavithana. 2025. "Audit, Alignment, and Optimization of LM-Powered Subroutines with Application to Public Comment Processing." arXiv preprint arXiv:2507.08109.
Wagle, S., S. Munikoti, R. Meyur, J. Whiting, H. Farr, A. Acharya, S. Horawalavithana, M. Halappanavar, J. Strube, and L. Fierce. 2025. "Leveraging Multimodal AI for Efficient Data Discovery in Wind Energy Research." In Practice and Experience in Advanced Research Computing 2025: The Power of Collaboration, 1–3.

2024

Horawalavithana, S., E. Ayton, A. Usenko, R. Cosbey, and S. Volkova. 2024. "Anticipating Technical Expertise and Capability Evolution in Research Communities Using Dynamic Graph Transformers." IEEE Transactions on Computational Social Systems 11(5):6982–7001.
Horawalavithana, S., S. Munikoti, I. Stewart, H. Kvinge, and K. Pazdernik. 2024. "SCITUNE: Aligning Large Language Models with Human-Curated Scientific Multimodal Instructions." In Proceedings of the 1st Workshop on NLP for Science (NLP4Science), The 2024 Conference on Empirical Methods in Natural Language Processing, 58–72.
Meyur, R., H. Phan, K. Hayashi, I. Stewart, S. Sharma, S. Chaturvedi, M. Parker, D. Nally, S. Montgomery, and K. Pazdernik. 2024. "Benchmarking LLMs for Environmental Review and Permitting." In Proceedings of the Workshop on Large Language Models for Scientific and Societal Advances (SciSoc LLM) at the 2025 ACM SIGKDD Conference on Knowledge Discovery and Data Mining.
Munikoti, S., A. Acharya, S. Wagle, and S. Horawalavithana. 2024. "Evaluating the Effectiveness of Retrieval-Augmented Large Language Models in Scientific Document Reasoning." In Proceedings of the Fourth Workshop on Scholarly Document Processing (SDP 2024), 84–89.
Stewart, I., S. Horawalavithana, B. Kennedy, S. Munikoti, and K. Pazdernik. 2024. "Surprisingly Fragile: Assessing and Addressing Prompt Instability in Multimodal Foundation Models." arXiv preprint arXiv:2408.14595.

2023

Acharya, A., S. Munikoti, A. Hellinger, S. Smith, S. Wagle, and S. Horawalavithana. 2023. "NuclearQA: A Human-Made Benchmark for Language Models for the Nuclear Domain." arXiv preprint arXiv:2310.10920.
Horawalavithana, S., R. De Silva, N. Weerasekara, N.K. Wai, M. Nabeel, B. Abayaratna, C. Elvitigala, P. Wijesekera, and A. Iamnitchi. 2023. "Vaccination Trials on Hold: Malicious and Low Credibility Content on Twitter during the AstraZeneca COVID-19 Vaccine Development." Computational and Mathematical Organization Theory 29(3):448–469.
Iamnitchi, A., L.O. Hall, S. Horawalavithana, F. Mubang, K.W. Ng, and J. Skvoretz. 2023. "Modeling Information Diffusion in Social Media: Data-Driven Observations." Frontiers in Big Data 6:1135191.
Munikoti, S., A. Acharya, S. Wagle, and S. Horawalavithana. 2023. "Atlantic: Structure-Aware Retrieval-Augmented Language Model for Interdisciplinary Science." In AI to Accelerate Science and Engineering, The Thirty-Eighth Annual AAAI Conference on Artificial Intelligence.
Wagle, S., S. Munikoti, A. Acharya, S. Smith, and S. Horawalavithana. 2023. "Empirical Evaluation of Uncertainty Quantification in Retrieval-Augmented Language Models for Science." In Scientific Document Understanding, The Thirty-Eighth Annual AAAI Conference on Artificial Intelligence.

2022

Botzer, N.A., Y.S. Horawalavithana, T. Weninger, and S. Volkova. 2022. "Lessons from Developing Multimodal Models with Code and Developer Interactions." In I Can't Believe It's Not Better Workshop: Understanding Deep Learning Through Empirical Falsification.
Horawalavithana, Y.S., E.M. Ayton, A.A. Usenko, S. Sharma, J. Eshun, R.J. Cosbey, and M.F. Glenski, et al. 2022. "EXPERT: Public Benchmarks for Dynamic Heterogeneous Academic Graphs." Presented by Y.S. Horawalavithana at Graph Learning Benchmarks Workshop, The Web Conference, Virtual, Florida. arXiv preprint arXiv:2204.07203
Horawalavithana, Y.S., E.M. Ayton, S. Sharma, S.A. Howland, M. Subramanian, S.W. Vasquez, and R.J. Cosbey, et al. 2022. "Foundation Models of Scientific Knowledge for Chemistry: Opportunities, Challenges and Lessons Learned." In Proceedings of BigScience Episode #5 -- Workshop on Challenges & Perspectives in Creating Large Language Models, May 2022, Virtual and Dublin, Ireland, 160–172. doi: 10.18653/v1/2022.bigscience-1.12
Horawalavithana, S., N. Choudhury, J. Skvoretz, et al. 2022. “Online discussion threads as conversation pools: predicting the growth of discussion threads on reddit.” Comput Math Organ Theory 28, 112–140 (2022). doi: 10.1007/s10588-021-09340-1
Horawalavithana, S., R. De Silva, N. Weerasekara, N.G. Kin Wai, M. Nabeel, B. Abayaratna, C. Elvitigala, P. Wijesekera, and A. Iamnitchi. 2022. “Vaccination trials on hold: malicious and low credibility content on Twitter during the AstraZeneca COVID-19 vaccine development.” Comput Math Organ Theory. 1-22. doi: 10.1007/s10588-022-09370-3
Kin Wai, N.G., S. Horawalavithana, and A.bIamnitchi. 2022. “Forecasting topic activity with exogenous and endogenous information signals in Twitter.” In Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining. Association for Computing Machinery, New York, NY, USA, 95–98. doi: 10.1145/3487351.3488344
Ng, K.W., S. Horawalavithana, and A. Iamnitchi. 2022. “Social media activity forecasting with exogenous and endogenous signals.” Social Network Analysis and Mining 12, 102 doi: 10.1007/s13278-022-00927-3

2021

Horawalavithana, S., R. De Silva, M. Nabeel, C. Elvitigala, P. Wijesekara, and A. Iamnitchi. 2021. “Malicious and low credibility urls on twitter during the astrazeneca covid-19 vaccine development.” In Social, Cultural, and Behavioral Modeling: 14th International Conference, Virtual Event, July 6–9, 2021, Proceedings 14 (pp. 3-12). arXiv preprint arXiv:2102.12223
Horawalavithana, S., K.W. Ng, and A. Iamnitchi. 2021. “Drivers of Polarized Discussions on Twitter during Venezuela Political Crisis.” In 13th ACM Web Science Conference 2021 Association for Computing Machinery, New York, NY, USA, 205–214. doi: 10.1145/3447535.3462496
Kin Wai, N.G., S. Horawalavithana, and A. Iamnitchi. 2021. “Multi-platform information operations: Twitter, facebook and youtube against the white helmets.” In Proceedings of the 14th International AAAI Conference on Web and Social Media, Atlanta, USA.

2020

Horawalavithana, S., K.W. Ng, and A. Iamnitchi. 2020. “Twitter is the megaphone of cross-platform messaging on the white helmets.” In Social, Cultural, and Behavioral Modeling: 13th International Conference, Washington, DC, USA, October 18–21, 2020, Proceedings 13 (pp. 235-244). doi: 10.1007/978-3-030-61255-9_23

2019

Alhazmi, E., N. Choudhury, S. Horawalavithana, and A. Iamnitchi. 2019. “Temporal Mobility Networks in Online Gaming.” Frontiers in Big Data, 2, 21. doi: 10.3389/fdata.2019.00021
Horawalavithana, S., J. Arroyo Flores, J. Skvoretz, et al. 2019. “The risk of node re-identification in labeled social graphs.” Applied Network Science 4, 33 (2019). doi:10.1007/s41109-019-0148-x
Horawalavithana, S., J.G.A. Flores, J. Skvoretz, and A. Iamnitchi. 2019. "Behind the Mask: Understanding the Structural Forces That Make Social Graphs Vulnerable to Deanonymization," in IEEE Transactions on Computational Social Systems, vol. 6, no. 6, pp. 1343-1356. doi: 10.1109/TCSS.2019.2951330
Horawalavithana, S., A. Bhattacharjee, R. Liu, N. Choudhury, L.O. Hall, and A. Iamnitchi. 2019. “Mentions of Security Vulnerabilities on Reddit, Twitter and GitHub.” In IEEE/WIC/ACM International Conference on Web Intelligence (WI '19). Association for Computing Machinery, New York, NY, USA, 200–207. doi: 10.1145/3350546.3352519
Horawalavithana, S., C. Gandy, J.A. Flores, J. Skvoretz, and A. Iamnitchi. 2019. “Diversity, homophily and the risk of node re-identification in labeled social graphs.” Complex Networks and Their Applications VII. COMPLEX NETWORKS 2018. Studies in Computational Intelligence, vol. 813. Springer, Cham. doi: 10.1007/978-3-030-05414-4_32
Liu, R., F. Mubang, L. O. Hall, S. Horawalavithana, A. Iamnitchi, and J. Skvoretz. 2019. "Predicting Longitudinal User Activity at Fine Time Granularity in Online Collaborative Platforms." 2019 IEEE International Conference on Systems, Man and Cybernetics, Bari, Italy, pp. 2535-2542, doi: 10.1109/SMC.2019.8914586