Login / Signup
The Right to Talk: An Audio-Visual Transformer Approach.
Thanh-Dat Truong
Chi Nhan Duong
The De Vu
Hoang Anh Pham
Bhiksha Raj
Ngan Le
Khoa Luu
Published in:
ICCV (2021)
Keyphrases
</>
audio visual
multi modal
visual information
visual data
emotion recognition
temporal context
video summarization
multi stream
person authentication
multimedia
audio visual content
multimodal fusion
text classification
three dimensional
knowledge base
audio visual speech recognition
data sets