Login / Signup
A Two-Stage Audio-Visual Fusion Piano Transcription Model Based on the Attention Mechanism.
Yuqing Li
Xianke Wang
Ruimin Wu
Wei Xu
Wenqing Cheng
Published in:
IEEE ACM Trans. Audio Speech Lang. Process. (2024)
Keyphrases
</>
audio visual
attention mechanism
multimodal fusion
person authentication
multi modal
video summarization
visual attention model
visual attention
visual information
multi stream
multimedia
visual data
search engine
visual features
audio features