Sign in

Automatic Prosody Annotation with Pre-Trained Text-Speech Model.

Ziqian DaiJianwei YuYan WangNuo ChenYanyao BianGuangzhi LiDeng CaiDong Yu
Published in: INTERSPEECH (2022)
Keyphrases
  • probabilistic model
  • statistical model
  • real time
  • image sequences
  • training data
  • support vector
  • audio visual