Towards Generalizable SER: Soft Labeling and Data Augmentation for Modeling Temporal Emotion Shifts in Large-Scale Multilingual Speech.
Mohamed OsmanTamer NadeemGhada KhoribaPublished in: CoRR (2023)
Keyphrases
- data collection
- high quality
- database
- temporal data
- training data
- data analysis
- data points
- temporal evolution
- data sets
- data quality
- raw data
- labeled data
- data processing
- active learning
- video sequences
- data sources
- spatio temporal
- high dimensional data
- synthetic data
- speech recognition
- temporal information
- data structure
- log data
- learning algorithm
- multivariate time series
- real world
- labeling process