eSTImate: A Real-time Speech Transmission Index Estimator With Speech Enhancement Auxiliary Task Using Self-Attention Feature Pyramid Network.
Bajian XiangHongkun LiuZedong WuSu ShenXiangdong ZhangPublished in: INTERSPEECH (2023)
Keyphrases
- speech enhancement
- real time
- speech signal
- noisy environments
- noise reduction
- signal to noise ratio
- single channel
- linear prediction
- vocal tract
- multiscale
- multiresolution
- least squares
- speech recognition
- multi channel
- noisy speech
- feature vectors
- feature extraction
- audio visual
- background noise
- maximum likelihood
- input image
- smoothing algorithm