Login / Signup
Integrating Self-supervised Speech Model with Pseudo Word-level Targets from Visually-grounded Speech Model.
Hung-Chieh Fang
Nai-Xuan Ye
Yi-Jen Shih
Puyuan Peng
Hsuan-Fu Wang
Layne Berry
Hung-yi Lee
David Harwath
Published in:
CoRR (2024)
Keyphrases
</>
context dependent
information retrieval
statistical model