• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Multi-Modal Learning for Speech Emotion Recognition: An Analysis and Comparison of ASR Outputs with Ground Truth Transcription.

Saurabh SahuVikramjit MitraNadee SeneviratneCarol Y. Espy-Wilson
Published in: INTERSPEECH (2019)
Keyphrases
  • multi modal
  • ground truth
  • statistical analysis
  • multi modality
  • high quality
  • image analysis
  • semantic concepts
  • multimedia
  • video analysis
  • video search
  • cross modal