Effects of out of vocabulary words in spoken document retrieval.
Philip C. WoodlandSue E. JohnsonPierre JourlinKaren Sparck JonesPublished in: SIGIR (2000)
Keyphrases
- spoken document retrieval
- out of vocabulary
- language model
- n gram
- word segmentation
- broadcast news
- named entity recognition
- cross language information retrieval
- hand crafted
- test collection
- information retrieval
- parallel corpora
- cross lingual
- cross language
- query terms
- machine translation
- language modeling
- language independent
- query translation
- term frequency
- named entities
- speech recognition
- natural language processing
- information extraction
- hidden markov models