Relational Proxy Loss for Audio-Text based Keyword Spotting.
Youngmoon JungSeungjin LeeJoon-Young YangJaeyoung RohChang Woo HanHoonyoung ChoPublished in: CoRR (2024)
Keyphrases
- keyword spotting
- speech processing
- multimedia
- speech recognition
- hidden markov models
- signal processing
- speaker identification
- printed documents
- visual information
- visual features
- handwritten documents
- multimedia systems
- semantic information
- neural network
- feature vectors
- text classification
- audio features
- natural language processing
- broadcast news
- keywords
- bayesian networks
- feature extraction
- artificial intelligence