Login / Signup

Style-transfer based Speech and Audio-visual Scene understanding for Robot Action Sequence Acquisition from Videos.

Chiori HoriPuyuan PengDavid HarwathXinyu LiuKei OtaSiddarth JainRadu CorcodelDevesh K. JhaDiego RomeresJonathan Le Roux
Published in: INTERSPEECH (2023)
Keyphrases