Login / Signup
One Model, Multiple Modalities: A Sparsely Activated Approach for Text, Sound, Image, Video and Code.
Yong Dai
Duyu Tang
Liangxin Liu
Minghuan Tan
Cong Zhou
Jingquan Wang
Zhangyin Feng
Fan Zhang
Xueyu Hu
Shuming Shi
Published in:
CoRR (2022)
Keyphrases
</>
multiple modalities
input image
image segmentation
image content
high level
image analysis
image classification
image set
image data
video streams
image features
image retrieval
multi modal
spatial and temporal
web images
spatial context
multimedia