Tragic Talkers: A Shakespearean Sound- and Light-Field Dataset for Audio-Visual Machine Learning Research.
Davide BerghiMarco VolinoPhilip J. B. JacksonPublished in: CVMP (2022)
Keyphrases
- audio visual
- light field
- machine learning
- multi modal
- sound source
- multi view
- image formation
- super resolution
- visual information
- light field rendering
- multi stream
- visual data
- multimedia
- audio visual speech recognition
- pattern recognition
- active learning
- computer vision
- text classification
- text mining
- data analysis
- feature selection