Unified Keyword Spotting and Audio Tagging on Mobile Devices with Transformers.
Heinrich DinkelYongqing WangZhiyong YanJunbo ZhangYujun WangPublished in: ICASSP (2023)
Keyphrases
- keyword spotting
- mobile devices
- speech processing
- speech recognition
- signal processing
- hidden markov models
- speaker identification
- printed documents
- multimedia
- metadata
- audio visual
- visual information
- machine learning
- handwritten documents
- information retrieval
- natural language processing
- digital libraries
- feature selection
- english text
- artificial intelligence
- neural network