Login / Signup
OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language Identification.
Yifan Peng
Yui Sudo
Muhammad Shakeel
Shinji Watanabe
Published in:
CoRR (2024)
Keyphrases
</>
speech recognition
speaker identification
speech signal
speech synthesis
language model
hidden markov models
automatic speech recognition
language identification
speech processing
probabilistic model
speaker recognition
speech recognition systems
similarity measure
maximum likelihood
noisy speech