Login / Signup
Unified Keyword Spotting and Audio Tagging on Mobile Devices with Transformers.
Heinrich Dinkel
Yongqing Wang
Zhiyong Yan
Junbo Zhang
Yujun Wang
Published in:
CoRR (2023)
Keyphrases
</>
keyword spotting
mobile devices
speech processing
speech recognition
signal processing
hidden markov models
speaker identification
metadata
multimedia
printed documents
audio visual
multimedia systems
natural language processing
visual information
language model
handwritten documents
image analysis