RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language Model.
Jianhao YuanShuyang SunDaniel OmeizaBo ZhaoPaul NewmanLars KunzeMatthew GaddPublished in: CoRR (2024)
Keyphrases
- multi modal
- language model
- retrieval model
- query expansion
- language modeling
- cross modal
- test collection
- information retrieval
- video search
- document retrieval
- probabilistic model
- statistical language models
- audio visual
- context sensitive
- language models for information retrieval
- query terms
- n gram
- multi modality
- ad hoc information retrieval
- speech recognition
- relevance model
- language modelling
- information retrieval systems
- active learning
- text retrieval
- retrieval systems
- mixture model
- image classification
- smoothing methods