Learning Optimal Policy for Simultaneous Machine Translation via Binary Search.
Shoutao GuoShaolei ZhangYang FengPublished in: ACL (1) (2023)
Keyphrases
- machine translation
- optimal policy
- reinforcement learning
- learning algorithm
- binary search
- infinite horizon
- information extraction
- natural language processing
- language independent
- average reward reinforcement learning
- finite state
- state space
- markov decision processes
- target language
- state dependent
- knowledge representation
- artificial intelligence