Temporal Sub-Sampling of Audio Feature Sequences for Automated Audio Captioning.
Khoa NguyenKonstantinos DrossosTuomas VirtanenPublished in: DCASE (2020)
Keyphrases
- multimedia
- visual information
- cepstral features
- signal processing
- spatio temporal
- music score
- visual data
- audio visual
- audio video
- hidden markov models
- temporal information
- audio signals
- audio features
- data sets
- audio recordings
- music information retrieval
- temporal sequences
- event sequences
- semi automated
- temporal patterns
- sample size