Audio Captioning Transformer.
Xinhao MeiXubo LiuQiushi HuangMark D. PlumbleyWenwu WangPublished in: CoRR (2021)
Keyphrases
- multimedia
- fuzzy logic
- audio visual
- fault diagnosis
- visual information
- visual data
- audio signals
- neural network
- cross modal
- high voltage
- signal processing
- audio video
- audio files
- power transformers
- cepstral features
- music genre classification
- audio stream
- database
- broadcast news
- power system
- feature selection
- digital video
- video data
- decision making
- artificial intelligence
- media streams
- databases