Sign in

AutoAD II: The Sequel - Who, When, and What in Movie Audio Description.

Tengda HanMax BainArsha NagraniGül VarolWeidi XieAndrew Zisserman
Published in: CoRR (2023)
Keyphrases
  • multimedia
  • shape description
  • visual information
  • digital audio
  • artificial intelligence
  • website
  • case study
  • high level
  • image sequences
  • video sequences
  • hidden markov models
  • speaker identification
  • audio signals