Watch and Learn: Mapping Language and Noisy Real-world Videos with Self-supervision.
Yujie ZhongLinhai XieSen WangLucia SpeciaYishu MiaoPublished in: CoRR (2020)
Keyphrases
- real world
- wide range
- language learning
- data sets
- programming language
- case study
- video sequences
- language processing
- dynamic scenes
- natural language
- video surveillance
- active learning
- noisy data
- synthetic data
- human activities
- ontology mapping
- event detection
- object oriented
- image retrieval
- bayesian networks
- computer vision
- data mining