A Framework for Vision-Language Warm-up Tasks in Multimodal Dialogue Models.
Jaewook LeeSeongsik ParkSeong-Heum ParkHongjin KimHarksoo KimPublished in: EMNLP (2023)
Keyphrases
- probabilistic model
- modeling framework
- language learning
- mathematical framework
- computer vision
- statistical models
- main contribution
- natural language
- vision system
- image processing
- context dependent
- real time
- modelling language
- programming language
- prior knowledge
- complex systems
- bayesian framework
- knowledge base
- conceptual models