UGen: Multi-modal Music Understanding and Generation with the Power of Large Language Models.
Atin Sakkeer HussainShansong LiuChenshuo SunYing ShanPublished in: CoRR (2023)
Keyphrases
- multi modal
- language model
- language modeling
- n gram
- document retrieval
- language modelling
- information retrieval
- multi modality
- test collection
- query expansion
- speech recognition
- probabilistic model
- context sensitive
- retrieval model
- statistical language models
- high dimensional
- cross modal
- audio visual
- pseudo relevance feedback
- relevance model
- image annotation
- document ranking
- smoothing methods
- cross lingual
- translation model
- language models for information retrieval
- spoken term detection
- music information retrieval
- co occurrence
- feature vectors