Sign in

WanJuan: A Comprehensive Multimodal Dataset for Advancing English and Chinese Large Models.

Conghui HeZhenjiang JinChao XuJiantao QiuBin WangWei LiHang YanJiaqi WangDahua Lin
Published in: CoRR (2023)
Keyphrases
  • probabilistic model
  • experimental data
  • language learning
  • multimedia
  • bayesian networks
  • statistical model
  • statistical models
  • text summarization
  • english language
  • multimodal interaction