Login / Signup

StyleCap: Automatic Speaking-Style Captioning from Speech Based on Speech and Language Self-supervised Learning Models.

Kazuki YamauchiYusuke IjimaYuki Saito
Published in: CoRR (2023)
Keyphrases
  • learning models
  • speech recognition
  • text to speech
  • learning algorithm
  • real world
  • natural language
  • machine learning
  • information extraction
  • semi supervised learning
  • loss function
  • machine learning models