Reward Steering with Evolutionary Heuristics for Decoding-time Alignment.
Chia-Yu HungNavonil MajumderAmbuj MehrishSoujanya PoriaPublished in: CoRR (2024)
Keyphrases
- multiple sequence alignments
- genetic algorithm
- multiple sequence alignment
- search algorithm
- reinforcement learning
- evolutionary optimization
- evolutionary computation
- image alignment
- dynamic programming
- exact algorithms
- decoding algorithm
- metaheuristic
- heuristic search
- search methods
- phylogenetic trees
- genetic search
- data sets