Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification.

Published in: CoRR (2023)

Keyphrases