UniKW-AT: Unified Keyword Spotting and Audio Tagging.
Heinrich DinkelYongqing WangZhiyong YanJunbo ZhangYujun WangPublished in: CoRR (2022)
Keyphrases
- keyword spotting
- speech processing
- speech recognition
- signal processing
- speaker identification
- hidden markov models
- printed documents
- metadata
- multimedia
- machine learning
- artificial intelligence
- visual information
- audio visual
- multimedia systems
- image processing
- digital libraries
- computer vision
- multimedia information
- information retrieval