MuLan: A Joint Embedding of Music Audio and Natural Language.
Qingqing HuangAren JansenJoonseok LeeRavi GantiJudith Yue LiDaniel P. W. EllisPublished in: ISMIR (2022)
Keyphrases
- natural language
- audio signals
- music score
- music information retrieval
- audio files
- audio features
- music scores
- audio signal
- audio recordings
- music genre classification
- data embedding
- audio content
- automatic music genre classification
- music collections
- music retrieval
- human language
- content based music retrieval
- speech music discrimination
- digital audio
- natural language interface
- knowledge representation
- musical instruments
- multimedia
- polyphonic music
- digital music
- visual information
- computer music
- natural language processing
- vector space
- natural language generation
- genre classification
- machine learning
- language processing
- machine translation
- question answering
- hidden markov models
- audio visual
- signal processing
- nonlinear dimensionality reduction
- feature vectors
- artificial intelligence
- information retrieval
- feature set
- multi modal
- audio video
- visual data
- semantic analysis
- digital video