Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B.
Di ZhangXiaoshui HuangDongzhan ZhouYuqiang LiWanli OuyangPublished in: CoRR (2024)
Keyphrases
- monte carlo
- monte carlo simulation
- monte carlo methods
- learning perl
- importance sampling
- monte carlo tree search
- game tree
- markovian decision
- markov chain
- particle filter
- tree structure
- simulation study
- adaptive sampling
- temporal difference
- tree search
- point processes
- variance reduction
- monte carlo method
- stochastic approximation
- quasi monte carlo
- optimal solution
- decision trees
- matrix inversion
- confidence intervals
- mathematical models
- b tree
- index structure
- dynamic programming