Login / Signup
Exploring speaker enrolment for few-shot personalisation in emotional vocalisation prediction.
Andreas Triantafyllopoulos
Meishu Song
Zijiang Yang
Xin Jing
Björn W. Schuller
Published in:
CoRR (2022)
Keyphrases
</>
prediction accuracy
speech recognition
prediction error
video sequences
key frames
prediction algorithm
neural network
pattern recognition
artificial neural networks
mobile devices
visual features
change detection
prediction model
semantic web technologies
audio visual
video shots