Prompting Large Language Models with Audio for General-Purpose Speech Summarization.
Wonjune KangDeb RoyPublished in: CoRR (2024)
Keyphrases
- language model
- general purpose
- speech recognition
- audio stream
- word error rate
- audio visual
- broadcast news
- language modeling
- soccer video
- automatic speech recognition
- n gram
- document retrieval
- probabilistic model
- speech signal
- language modelling
- query expansion
- audio features
- spoken term detection
- information retrieval
- multimedia
- visual data
- statistical language models
- context sensitive
- multi modal
- test collection
- multi document summarization
- okapi bm
- retrieval model
- ad hoc information retrieval
- document ranking
- error rate
- language model for information retrieval
- language models for information retrieval
- out of vocabulary
- video search
- text summarization
- vector space model
- query terms
- smoothing methods
- translation model
- pseudo relevance feedback