Score Images as a Modality: Enhancing Symbolic Music Understanding through Large-Scale Multimodal Pre-Training.
Yang QinHuiming XieShuxue DingYujie LiBenying TanMingchuan YePublished in: Sensors (2024)
Keyphrases
- image database
- multi modal
- image data
- three dimensional
- image features
- input image
- image registration
- image annotation
- object recognition
- test images
- ground truth
- edge detection
- image classification
- image collections
- image retrieval
- region of interest
- multiple images
- multimedia
- medical images
- image analysis
- multiscale
- computer vision
- supervised learning
- feature points
- multiresolution
- gray level
- training examples
- segmentation method
- training set
- similarity measure
- training process
- multimodal image registration