Late multimodal fusion for image and audio music transcription.
María Alfaro-ContrerasJose J. Valero-MasJosé M. IñestaJorge Calvo-ZaragozaPublished in: CoRR (2022)
Keyphrases
- multimodal fusion
- image data
- image classification
- image features
- image content
- image representation
- image retrieval
- image collections
- low level
- multimedia
- music information retrieval
- visual data
- post processing
- feature points
- audio visual
- audio features
- audio signals
- three dimensional
- high robustness
- data mining
- music score
- automatic transcription