Sign in

Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models.

Yunfei ChuJin XuXiaohuan ZhouQian YangShiliang ZhangZhijie YanChang ZhouJingren Zhou
Published in: CoRR (2023)
Keyphrases
  • language model
  • multimedia
  • visual information
  • language modeling
  • information retrieval
  • probabilistic model
  • audio visual
  • image retrieval
  • text mining
  • n gram
  • test collection
  • language modelling