SimpleLLM4AD: An End-to-End Vision-Language Model with Graph Visual Question Answering for Autonomous Driving.
Peiru ZhengYun ZhaoZhan GongHong ZhuShaohua WuPublished in: CoRR (2024)
Keyphrases
- end to end
- language model
- question answering
- autonomous driving
- information retrieval
- passage retrieval
- language modeling
- document retrieval
- n gram
- retrieval model
- sentence retrieval
- natural language processing
- query expansion
- probabilistic model
- speech recognition
- information extraction
- translation model
- cross language
- test collection
- natural language
- named entities
- structured data
- computer vision
- query terms
- graph structure
- visual information
- vector space model
- pseudo relevance feedback
- relevance model
- video search
- document collections
- visual features
- vision system