Login / Signup

Exploring Few-Shot Fine-Tuning Strategies for Models of Visually Grounded Speech.

Tyler MillerDavid Harwath
Published in: INTERSPEECH (2022)
Keyphrases
  • fine tuning
  • statistical models
  • search engine
  • viable alternative
  • fine tune
  • data sets
  • multi agent systems
  • multi modal
  • speech recognition
  • complex systems
  • speech signal