Multi-Frame, Lightweight & Efficient Vision-Language Models for Question Answering in Autonomous Driving.
Akshay GopalkrishnanRoss GreerMohan M. TrivediPublished in: CoRR (2024)
Keyphrases
- lightweight
- question answering
- language model
- passage retrieval
- information retrieval
- multi frame
- language modeling
- sentence retrieval
- natural language processing
- document retrieval
- autonomous driving
- retrieval model
- speech recognition
- query expansion
- n gram
- information extraction
- computationally efficient
- probabilistic model
- test collection
- named entities
- wireless sensor networks
- computer vision
- vision system
- cross language
- query terms
- audio visual
- relevance model
- semantic roles
- point correspondences
- text retrieval
- retrieval effectiveness
- natural language
- image segmentation