Login / Signup
Integrating sequence information in the audio-visual detection of word prominence in a human-machine interaction scenario.
Andrea Schnall
Martin Heckmann
Published in:
INTERSPEECH (2014)
Keyphrases
</>
audio visual
human machine interaction
multi modal
visual data
visual information
temporal context
information extraction
contextual information
keywords
object recognition
intelligent systems
higher level