Sign in

WhisperX: Time-Accurate Speech Transcription of Long-Form Audio.

Max BainJaesung HuhTengda HanAndrew Zisserman
Published in: CoRR (2023)
Keyphrases
  • multimedia
  • high accuracy
  • visual information
  • information systems
  • signal processing
  • data sets
  • search engine
  • website
  • case study
  • information retrieval systems
  • computationally efficient