Clickbait? Sensational Headline Generation with Auto-tuned Reinforcement Learning.

Peng Xu Chien-Sheng Wu Andrea Madotto Pascale Fung

Published in: EMNLP/IJCNLP (1) (2019)

Keyphrases

reinforcement learning
function approximation
robot control
state space
markov decision processes
real time
temporal difference
optimal policy
learning process
generation algorithm
decision making
action selection
neural network
reinforcement learning algorithms
reinforcement learning methods
temporal difference learning
multi agent reinforcement learning
policy search
database
robotic control
control problems
dynamical systems
mobile robot
evolutionary algorithm
case study
decision trees
learning algorithm