Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search.
Max LiuChan-Hung YuWei-Hsu LeeCheng-Wei HungYen-Chun ChenShao-Hua SunPublished in: CoRR (2024)
Keyphrases
- language model
- reinforcement learning
- optimal policy
- language modeling
- n gram
- document retrieval
- probabilistic model
- speech recognition
- language modelling
- information retrieval
- state space
- ad hoc information retrieval
- statistical language models
- query expansion
- retrieval model
- test collection
- vector space model
- context sensitive
- mixture model
- query terms
- document ranking
- machine learning
- language models for information retrieval
- relevance model
- translation model
- smoothing methods
- statistical model
- pseudo relevance feedback
- statistical machine translation
- information retrieval systems
- learning algorithm
- word clouds