Login / Signup
Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning.
Jianyuan Sun
Xubo Liu
Xinhao Mei
Volkan Kiliç
Mark D. Plumbley
Wenwu Wang
Published in:
INTERSPEECH (2023)
Keyphrases
</>
feature set
feature vectors
low level
feature extraction
data fusion
multimedia
feature space
multimodal fusion
fuzzy logic
signal processing
power system
end to end
complex networks
low complexity
image fusion
network model
fusion method
audio features