Unifying Multimodal Transformer for Bi-directional Image and Text Generation.
Yupan HuangHongwei XueBei LiuYutong LuPublished in: ACM Multimedia (2021)
Keyphrases
- bi directional
- text generation
- image data
- image classification
- image features
- image analysis
- high resolution
- single image
- image content
- input image
- multiscale
- image representation
- image retrieval
- edge detection
- fuzzy logic
- image segmentation
- feature points
- segmentation method
- image regions
- natural language generation
- segmentation algorithm
- fault diagnosis