Audio Transformers: Transformer Architectures For Large Scale Audio Understanding. Adieu Convolutions.
Prateek VermaJonathan BergerPublished in: CoRR (2021)
Keyphrases
- multimedia
- signal processing
- audio stream
- audio video
- audio signals
- audio visual
- visual data
- real time
- small scale
- fuzzy logic
- real life
- search engine
- cross modal
- audio signal
- real world
- music score
- neural network
- multi modal
- visual information
- edge detection
- digital video
- music information retrieval
- broadcast news
- digital audio