Audio Captioning Transformer.
Xinhao MeiXubo LiuQiushi HuangMark D. PlumbleyWenwu WangPublished in: DCASE (2021)
Keyphrases
- multimedia
- visual information
- audio visual
- fuzzy logic
- signal processing
- audio video
- audio stream
- computational intelligence
- audio signals
- power system
- fault diagnosis
- visual data
- music score
- power transformers
- soccer video
- cross modal
- emotion recognition
- multi modal
- image processing
- partial discharge
- music scores
- real time
- incipient fault
- audio recordings
- distribution network
- computer vision
- neural network