Integrating sequence information in the audio-visual detection of word prominence in a human-machine interaction scenario.

Published in: INTERSPEECH (2014)

Keyphrases