Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language.
Alexei BaevskiArun BabuWei-Ning HsuMichael AuliPublished in: CoRR (2022)
Keyphrases
- natural language
- dialogue system
- computer vision
- learning process
- learning analytics
- language acquisition
- information retrieval
- multiple representations
- active learning
- online learning
- learning systems
- learning tasks
- multimodal interfaces
- real time
- language learning
- mobile learning
- knowledge acquisition
- vision system
- prior knowledge
- reinforcement learning
- neural network