A CRNN-GRU Based Reinforcement Learning Approach to Audio Captioning.
Xuenan XuHeinrich DinkelMengyue WuKai YuPublished in: DCASE (2020)
Keyphrases
- reinforcement learning
- multimedia
- function approximation
- visual information
- state space
- learning algorithm
- markov decision processes
- model free
- reinforcement learning algorithms
- robotic control
- signal processing
- optimal policy
- transfer learning
- visual data
- temporal difference learning
- text graphics
- audio stream
- cepstral features
- audio recordings
- policy search
- audio video
- reinforcement learning methods
- broadcast news
- audio features
- markov decision process
- emotion recognition
- visual features
- image sequences
- feature selection