Data Scientist, Biological Sciences
Sean Colby is an expert in the application of computational resources to solve multidisciplinary problems, often involving massive datasets (big data). His focus has primarily been in service of “standards free metabolomics,” which involves expanding molecular reference libraries computationally, as opposed to cost- and time- prohibitive experimental acquisition. This research has yielded several open-source software packages, including ISiCLE, the in silico chemical library engine, a quantum-chemistry pipeline for molecular property prediction; DarkChem, a generative deep neural network to predict molecular properties and generate potentially novel metabolites; and DEIMoS, or data extraction for integrated multidimensional spectrometry, to process mass spectrometry data (e.g. LC-MS/MS, LC-IMS-MS/MS) with N-dimensional data as input.

Research Interest

  • Metabolomics
  • Standards- and library- free compound identification
  • Cheminformatics, bioinformatics
  • Quantum chemistry
  • Artificial intelligence, machine learning
  • Computer vision
  • High-performance computing

Disciplines and Skills

  • Cheminformatics
  • Bioinformatics
  • Modeling and simulation
  • Machine learning
  • Deep learning
  • Computer vision


  • BS, Bioengineering, Washington State University
  • MS, Computer Science, Georgia Institute of Technology



