Sign in

Weakly-Supervised Speech-to-Text Mapping with Visually Connected Non-Parallel Speech-Text Data Using Cyclic Partially-Aligned Transformer.

Johanes EffendiSakriani SaktiSatoshi Nakamura
Published in: Interspeech (2021)
Keyphrases