MAE-AST: Masked Autoencoding Audio Spectrogram Transformer.
Alan BaadePuyuan PengDavid HarwathPublished in: INTERSPEECH (2022)
Keyphrases
- multimedia
- fuzzy logic
- signal processing
- audio video
- pattern analysis
- visual data
- audio stream
- fault diagnosis
- multimedia information
- audio signals
- power system
- audio visual
- artificial intelligence
- power transformers
- audio recordings
- visual information
- high voltage
- audio signal
- neural network
- music score
- cepstral features
- condition monitoring
- audio features
- digital video
- speech signal
- correlation coefficient
- metadata
- e learning