Login / Signup

AVLnet: Learning Audio-Visual Language Representations from Instructional Videos.

Andrew RouditchenkoAngie W. BoggustDavid HarwathBrian ChenDhiraj JoshiSamuel ThomasKartik AudhkhasiHilde KuehneRameswar PandaRogério Schmidt FerisBrian KingsburyMichael PichenyAntonio TorralbaJames R. Glass
Published in: Interspeech (2021)
Keyphrases
  • learning systems
  • learning process
  • learning algorithm
  • multimedia
  • online learning
  • metadata
  • learning environment
  • image classification
  • hybrid learning