Creation of an Annotated German Broadcast Speech Database for Spoken Document Retrieval.
Stefan EickelerMartha A. LarsonWolff RüterJoachim KöhlerPublished in: LREC (2002)
Keyphrases
- spoken document retrieval
- cross language
- speech recognition errors
- broadcast news
- spoken documents
- information retrieval
- speech recognition
- test collection
- text retrieval
- question answering
- speech corpus
- spontaneous speech
- metadata
- out of vocabulary
- machine learning
- document collections
- speech synthesis
- language model
- speech signal
- cross language information retrieval
- multi modal
- document retrieval