MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting.
Zhiqi AiZhiyong ChenShugong XuPublished in: CoRR (2024)
Keyphrases
- multi modal
- user defined
- keyword spotting
- speech recognition
- hidden markov models
- speech processing
- printed documents
- digital libraries
- language independent
- data types
- cross lingual
- handwritten documents
- image annotation
- query language
- cross language
- video search
- database
- high dimensional
- audio visual
- information retrieval