Audiovisual Speaker Separation with Full- and Sub-Band Modeling in the Time-Frequency Domain.
Vahid Ahmadi KalkhoraniAnurag KumarKe TanBuye XuDeLiang WangPublished in: ICASSP (2024)
Keyphrases
- subband
- wavelet transform
- frequency domain
- high frequency
- audio visual
- low frequency
- wavelet packet
- multiresolution
- wavelet coefficients
- wavelet decomposition
- image compression
- discrete wavelet transform
- filter bank
- feature vectors
- low pass
- wavelet domain
- bit rate
- signal processing
- frequency band
- high pass
- visual information
- spatial domain
- speech recognition
- image quality
- machine learning
- locally adaptive
- lifting scheme