Exploring Better Text Image Translation with Multimodal Codebook.
Zhibin LanJiawei YuXiang LiWen ZhangJian LuanBin WangDegen HuangJinsong SuPublished in: ACL (1) (2023)
Keyphrases
- image representation
- input image
- image classification
- image features
- image data
- image content
- multiscale
- single image
- web images
- image retrieval
- vector quantization
- low level
- image segmentation
- segmentation method
- image analysis
- vector quantized
- image regions
- hough transform
- text retrieval
- similarity measure
- information retrieval
- image collections
- image matching
- edge detection
- feature points
- machine translation
- high resolution
- pixel values
- text mining
- scanned documents
- multimodal image registration