Login / Signup
Distributed Audio-Visual Parsing Based On Multimodal Transformer and Deep Joint Source Channel Coding.
Penghong Wang
Jiahui Li
Mengyao Ma
Xiaopeng Fan
Published in:
ICASSP (2022)
Keyphrases
</>
audio visual
multi modal
multi stream
visual information
multimodal fusion
distributed systems
visual data
multimedia
peer to peer
image transmission
image classification
natural language processing
multiscale
error correction
source coding
image processing
computer vision