Matching Latent Encoding for Audio-Text based Keyword Spotting.
Kumari NishuMinsik ChoDevang NaikPublished in: INTERSPEECH (2023)
Keyphrases
- keyword spotting
- speech processing
- multimedia
- speech recognition
- hidden markov models
- signal processing
- speaker identification
- pattern recognition
- natural language processing
- image matching
- audio visual
- variable length
- printed documents
- text mining
- semantic information
- music information retrieval
- handwritten documents
- video sequences
- artificial intelligence