Login / Signup
Improving Speech-Based End-of-Turn Detection Via Cross-Modal Representation Learning with Punctuated Text Data.
Ryo Masumura
Mana Ihori
Tomohiro Tanaka
Atsushi Ando
Ryo Ishii
Takanobu Oba
Ryuichiro Higashinaka
Published in:
ASRU (2019)
Keyphrases
</>
text data
perceptual information
cross modal
learning algorithm
visual recognition
active learning
image representation
database
co occurrence
structured data
multi modal
text classification
supervised learning
image data
high dimensional
multiscale
training data
data mining