Login / Signup
uaMix-MAE: Efficient Tuning of Pretrained Audio Transformers with Unsupervised Audio Mixtures.
Afrina Tabassum
Dung N. Tran
Trung Dang
Ismini Lourentzou
Kazuhito Koishida
Published in:
CoRR (2024)
Keyphrases
</>
multimedia
audio video
audio visual
signal processing
music score
visual information
video sequences
image segmentation
visual data
audio signals
audio features
cepstral features
speaker identification
multimedia information
emotion recognition
digital video
unsupervised learning
image quality