WAVPROMPT: Towards Few-Shot Spoken Language Understanding with Frozen Language Models.
Heting GaoJunrui NiKaizhi QianYang ZhangShiyu ChangMark Hasegawa-JohnsonPublished in: CoRR (2022)
Keyphrases
- language understanding
- language model
- natural language understanding
- language modeling
- document retrieval
- dialogue management
- speech recognition
- language modelling
- n gram
- language processing
- video shots
- information retrieval
- probabilistic model
- retrieval model
- test collection
- spoken dialogue systems
- key frames
- video data
- query expansion
- semantic interpretation
- statistical language models
- vector space model
- dialogue system
- general knowledge
- natural language
- video content
- query terms
- smoothing methods
- visual features
- news video
- cognitive psychology
- knowledge representation
- multimedia
- knowledge base
- feature selection
- artificial intelligence