SpeechFind
SpeechFind
The SpeechFind system [1][2] is a spoken document retrieval system currently serving as the search engine for the National Gallery of the Spoken Word (NGSW) [3] and Collaborative Digitization Program (CDP) [4]. The system constructed in two phases: i) enrollment and ii)query and retrieval. In the enrollment phase, large audio sets are submitted for audio segmentation and transcription generation and metadata construction. Once this phase is completed, the audio, transcription, and meta data are entered into an online depository, the audio material is then available through the online audio search engine for the query and retrieval phase.

[1] http://SpeechFind.utdallas.edu
[2] J. H. L. Hansen, R. Huang, B. Zhou, M. Seadle, J. R. Deller, Jr., A. R. Gurijala, M. Kurimo, and P. Angkititrakul, "SpeechFind: Advances in Spoken Document Retrieval for a National Gallery of the Spoken Word," IEEE Trans. on Speech and Audio Proc., vol. 13, no. 5, pp.712-730, Sep. 2005.
[4] http://www.cdpheritage.org
Participants: Wooil Kim
Center for Robust Speech Systems
CRSS Office :
972 883 4749
e-mail: Speech
Prof. John H.L. Hansen :
972 883 2910
e-mail: John H.L. Hansen
Updated: March 28, 2008