Fusing Vision and Language Models to Generate Sequence of Recipe Images from Steps.
Hshmat SahakPublished in: Tiny Papers @ ICLR (2024)
Keyphrases
- language model
- language modeling
- image features
- probabilistic model
- image data
- image classification
- speech recognition
- image understanding
- n gram
- information retrieval
- document retrieval
- retrieval model
- image database
- statistical language models
- computer vision
- query expansion
- test collection
- image collections
- document ranking
- language modelling
- image retrieval
- context sensitive
- image annotation
- relevance model
- language model for information retrieval