A Multimodal Approach to Device-Directed Speech Detection with Large Language Models.
Dominik WagnerAlexander W. ChurchillSiddharth SigtiaPanayiotis G. GeorgiouMatt MirsamadiAarshee MishraErik MarchiPublished in: CoRR (2024)
Keyphrases
- language model
- speech recognition
- word error rate
- language modeling
- n gram
- document retrieval
- information retrieval
- probabilistic model
- retrieval model
- spoken term detection
- language modelling
- automatic speech recognition
- ad hoc information retrieval
- test collection
- statistical language models
- query terms
- audio visual
- query expansion
- multimodal interfaces
- speech signal
- smoothing methods
- out of vocabulary
- context sensitive
- mixture model
- language models for information retrieval
- term dependencies
- document ranking
- pseudo relevance feedback
- multi modal
- broadcast news
- translation model
- cross lingual
- okapi bm
- hidden markov models