Login / Signup
Random Utterance Concatenation Based Data Augmentation for Improving Short-video Speech Recognition.
Yist Y. Lin
Tao Han
Haihua Xu
Van Tung Pham
Yerbolat Khassanov
Tze Yuang Chong
Yi He
Lu Lu
Zejun Ma
Published in:
INTERSPEECH (2023)
Keyphrases
</>
speech recognition
language model
multimedia
video sequences
hidden markov models
speech recognizer
neural network
image processing
face recognition
training data
automatic speech recognition
speaker identification
speech synthesis
speech understanding
keyword spotting