Tragic Talkers: A Shakespearean Sound- and Light-Field Dataset for Audio-Visual Machine Learning Research.
Davide BerghiMarco VolinoPhilip J. B. JacksonPublished in: CoRR (2022)
Keyphrases
- audio visual
- light field
- machine learning
- multi modal
- sound source
- multi view
- image formation
- visual information
- super resolution
- light field rendering
- visual data
- multi stream
- audio visual speech recognition
- multimedia
- active learning
- text classification
- text mining
- data sets
- pattern recognition
- video sequences
- data analysis
- motion blur
- high quality
- computer vision