A Multimodal Approach for Percussion Music Transcription from Audio and Video.
Bernardo MarencoMagdalena FuentesFlorencia LanzaroMartín RocamoraAlvaro GómezPublished in: CIARP (2015)
Keyphrases
- audio features
- audio signals
- multimedia
- audio visual
- music retrieval
- audio content
- audio files
- digital audio
- story segmentation
- audio signal
- audio video
- music information retrieval
- digital video
- music genre classification
- multimodal fusion
- scene change detection
- multi modal
- visual data
- multimodal information
- music score
- video data
- broadcast news
- automatic transcription
- multimedia processing
- genre classification
- video files
- video sequences
- music scores
- automatic music genre classification
- video content
- feature set
- video streams
- news video
- video content analysis
- music collections
- visual speech
- digital music
- visual information
- audio recordings
- multimedia information
- low level
- signal processing
- multiple modalities
- closed captions
- acoustic features
- video frames
- real time
- video retrieval
- visual features
- audio visual content
- multimedia data
- multimedia content
- video recordings
- video clips
- video material
- cross modal
- online video
- soccer video
- lecture videos