Sign in

TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion.

Samuel PeggKai LiXiaolin Hu
Published in: CoRR (2024)
Keyphrases
  • high level
  • probabilistic model
  • visual information