Low Latency Time Domain Multichannel Speech and Music Source Separation.
Gerald SchullerPublished in: CoRR (2022)
Keyphrases
- low latency
- source separation
- audio features
- single channel
- audio visual
- music information retrieval
- frequency domain
- low level
- music retrieval
- high speed
- visual features
- highly efficient
- high throughput
- feature set
- multi channel
- speaker identification
- real time
- virtual machine
- audio signal
- sound source
- multi modal
- stream processing
- independent component analysis
- high level
- image processing
- blind source separation
- speech signal
- visual data
- text data
- spatio temporal
- speech recognition
- mobile nodes
- temporal structure