Login / Signup

Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models.

Dominik WagnerAlexander W. ChurchillSiddharth SigtiaPanayiotis G. GeorgiouMatt MirsamadiAarshee MishraErik Marchi
Published in: CoRR (2023)
Keyphrases
  • multimodal data
  • pattern recognition
  • speech recognition
  • feature selection
  • query processing
  • nearest neighbor
  • image classification