IITK at SemEval-2024 Task 10: Who is the speaker? Improving Emotion Recognition and Flip Reasoning in Conversations via Speaker Embeddings.
Shubham PatelDivyaksh ShuklaAshutosh ModiPublished in: CoRR (2024)
Keyphrases
- emotion recognition
- speaker verification
- audio visual
- multi modal
- speaker recognition
- emotional speech
- sentiment analysis
- human computer interaction
- noisy environments
- speech recognition
- acoustic features
- multimedia
- emotion classification
- visual information
- facial expressions
- automatic speech recognition
- emotional state
- knowledge base
- image classification
- speaker diarization
- feature vectors
- information extraction
- visual data
- artificial intelligence
- natural language processing