Masked Autoencoders with Multi-Window Attention Are Better Audio Learners.
Sarthak YadavSergios TheodoridisLars Kai HansenZheng-Hua TanPublished in: CoRR (2023)
Keyphrases
- learning experience
- multimedia
- denoising
- learning process
- learning processes
- e learning
- learning resources
- window size
- sliding window
- learning activities
- signal processing
- collaborative learning
- audio visual
- ubiquitous learning
- socio cultural
- positive feedback
- learning styles
- visual features
- visual information
- language learning
- visual data
- teaching materials
- artificial neural networks
- learning companion