December 14, 2016
Conference Paper

ROFI – The use of Repeated Optimization for Feature Interpretation

Abstract

High-throughput molecular profiling technologies have the capability to capture both global and targeted scale data for multiple biomolecules. In the case of biomarker discovery, building predictive models via machine learning has become a common strategy for evaluating these large complex datasets. In biology however, identification of a collection of features (e.g., metabolites) that work well in combination is vital to the identification of candidate biomarkers of interest for validation or to gain insight into the biological mechanisms associated with the outcome of interest. Thus, gaining insight into the value of the feature in the model is important for inferring relationships between the features and the phenotype. We present an approach that uses repeated optimization to enable improved feature interpretation (ROFI). ROFI is shown to return better or comparable results to existing methods for true negatives. Furthermore, application to a real world dataset associated with type 1 diabetes confirms that ROFI identifies metabolites with a clear connection to the disease in an easy to interpret format.

Revised: March 29, 2017 | Published: December 14, 2016

Citation

Webb-Robertson B.M., L.M. Bramer, S.M. Reehl, T.O. Metz, Q. Zhang, M. Rewers, and B. Frohnert. 2016. ROFI – The use of Repeated Optimization for Feature Interpretation. In International Conference on Computational Science and Computational Intelligence (CSCI 2016), December 15-17, 206, Las Vegas, Nevada, edited by HR Arabnia, L Deligiannidis and M Yang. Piscataway, New Jersey:IEEE. PNWD-SA-10276. doi:10.1109/CSCI.2016.0013