An Adaptive Learning Method for Solving the Extreme Learning Rate Problem of Transformer.
Jianbang DingXuancheng RenRuixuan LuoPublished in: NLPCC (1) (2023)
Keyphrases
- learning rate
- delta bar delta
- convergence rate
- learning algorithm
- high accuracy
- fuzzy logic
- genetic algorithm
- error function
- training algorithm
- multi class
- dynamic programming
- pairwise
- reinforcement learning
- optimization algorithm
- optimization method
- sensitivity analysis
- multi objective
- hidden layer
- multilayer neural networks
- neural network