Pix2seq: A Language Modeling Framework for Object Detection.
Ting ChenSaurabh SaxenaLala LiDavid J. FleetGeoffrey E. HintonPublished in: CoRR (2021)
Keyphrases
- object detection
- language modeling framework
- language model
- query expansion
- language modeling
- document retrieval
- web search
- retrieval model
- computer vision
- topic modeling
- topic models
- object recognition
- relevance model
- smoothing methods
- information retrieval
- image understanding
- cross lingual
- low level
- retrieval effectiveness
- sentence retrieval
- probabilistic model
- multi label
- recommender systems
- multimedia
- learning algorithm