Statistical and Visual Analysis of Audio, Text, and Image Features for Multi-Modal Music Genre Recognition.
Ben WilkesIgor VatolkinHeinrich MüllerPublished in: Entropy (2021)
Keyphrases
- multi modal
- visual analysis
- automatic music genre classification
- audio visual
- image features
- audio content
- genre classification
- audio features
- cross modal
- music genre classification
- object recognition
- video search
- audio signals
- music information retrieval
- multiple modalities
- musical instruments
- multi modality
- audio signal
- single modality
- information visualization
- text data
- computer vision
- information retrieval
- image content
- feature extraction
- data analysis
- uni modal
- keywords
- open source
- high dimensional
- semantic concepts
- multimedia
- feature set
- image classification
- image representation
- information theoretic
- user interface
- broadcast news
- medical images
- image processing
- metadata