Co-Training of Audio and Video Representations from Self-Supervised Temporal Synchronization.
Bruno KorbarDu TranLorenzo TorresaniPublished in: CoRR (2018)
Keyphrases
- co training
- media streams
- multimedia
- audio video
- semi supervised
- multi view
- semi supervised learning
- single view
- temporal information
- unlabelled data
- unlabeled data
- music score
- text classification
- labeled data
- training examples
- supervised learning
- video data
- video sequences
- email classification
- audio features
- video frames
- pairwise
- named entities
- machine learning
- gaussian process
- audio visual
- d objects
- similarity measure