• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models.

Dominik WagnerAlexander W. ChurchillSiddharth SigtiaPanayiotis G. GeorgiouMatt MirsamadiAarshee MishraErik Marchi
Published in: CoRR (2023)
Keyphrases
  • multimodal data
  • pattern recognition
  • speech recognition
  • feature selection
  • query processing
  • nearest neighbor
  • image classification