November 1, 2012
Journal Article

Speech information retrieval: a review

Abstract

Audio is an information-rich component of multimedia. Information can be extracted from audio in a number of different ways, and thus there are several established audio signal analysis research fields. These fields include speech recognition, speaker recognition, audio segmentation and classification, and audio finger-printing. The information that can be extracted from tools and methods developed in these fields can greatly enhance multimedia systems. In this paper, we present the current state of research in each of the major audio analysis fields. The goal is to introduce enough back-ground for someone new in the field to quickly gain high-level understanding and to provide direction for further study.

Revised: January 11, 2013 | Published: November 1, 2012

Citation

Hafen R.P., and M.J. Henry. 2012. Speech information retrieval: a review. Multimedia Systems 18, no. 6:499-518. PNNL-SA-77090. doi:10.1007/s00530-012-0266-0