Lexical Acquisition from Audio-Visual Streams Using a Multimodal Recurrent State-Space Model.

Published in: ICDL (2023)

Keyphrases