MuLan: A Joint Embedding of Music Audio and Natural Language.
Qingqing HuangAren JansenJoonseok LeeRavi GantiJudith Yue LiDaniel P. W. EllisPublished in: CoRR (2022)
Keyphrases
- natural language
- audio signals
- music score
- music information retrieval
- audio features
- music scores
- audio recordings
- audio signal
- music genre classification
- audio files
- music retrieval
- automatic music genre classification
- speech music discrimination
- music collections
- audio content
- data embedding
- digital audio
- musical instruments
- multimedia
- human language
- natural language interface
- machine learning
- digital music
- polyphonic music
- content based music retrieval
- audio visual
- language processing
- genre classification
- low level
- computer music
- information extraction
- audio video
- natural language generation
- semantic analysis
- knowledge representation
- data hiding
- natural language processing
- multimedia information
- music composition
- signal processing
- machine translation
- visual data
- speaker identification