The Sjtu System For Multimodal Information Based Speech Processing Challenge 2021.
Wei WangXun GongYifei WuZhikai ZhouChenda LiWangyou ZhangBing HanYanmin QianPublished in: ICASSP (2022)
Keyphrases
- speech processing
- multimodal information
- signal processing
- speech recognition
- multimedia systems
- speaker identification
- natural language processing
- artificial intelligence
- visual data
- video data
- english text
- information extraction
- variable length
- image processing
- machine learning
- nearest neighbor
- gaussian mixture model
- natural language
- pattern recognition