Enhancing the Spatial Awareness Capability of Multi-Modal Large Language Model.
Yongqiang ZhaoZhenyu LiZhi JinFeng ZhangHaiyan ZhaoChengfeng DouZhengwei TaoXinhai XuDonghong LiuPublished in: CoRR (2023)
Keyphrases
- multi modal
- language model
- language modeling
- n gram
- probabilistic model
- document retrieval
- speech recognition
- language modelling
- query expansion
- information retrieval
- retrieval model
- multi modality
- context sensitive
- mixture model
- statistical language models
- high dimensional
- audio visual
- smoothing methods
- ad hoc information retrieval
- test collection
- image annotation
- video search
- word clouds
- query specific
- relevance model
- query processing
- feature extraction
- uni modal
- high level