Login / Signup
Disambiguating Speech Intention via Audio-Text Co-attention Framework: A Case of Prosody-semantics Interface.
Won-Ik Cho
Jeonghwa Cho
Woo Hyun Kang
Nam Soo Kim
Published in:
CoRR (2019)
Keyphrases
</>
text to speech
speech synthesis
multimedia
speech recognition
audio visual
prosodic features
logical framework
multi stream
information retrieval
user interface
text recognition
spontaneous speech
text input
text graphics