SoMeLVLM: A Large Vision Language Model for Social Media Processing.
Xinnong ZhangHaoyu KuangXinyi MouHanjia LyuKun WuSiming ChenJiebo LuoXuanjing HuangZhongyu WeiPublished in: CoRR (2024)
Keyphrases
- language model
- media processing
- language modeling
- document retrieval
- n gram
- multimedia processing
- probabilistic model
- information retrieval
- mixture model
- computer vision
- query expansion
- test collection
- video processing
- retrieval model
- multimedia information
- real time
- smoothing methods
- content analysis
- document representation
- image processing
- ad hoc information retrieval
- recent advances
- multimedia
- modular design
- visual features
- co occurrence
- translation model
- metadata