Non-Vacuous Generalization Bounds for Large Language Models.
Sanae LotfiMarc FinziYilun KuangTim G. J. RudnerMicah GoldblumAndrew Gordon WilsonPublished in: CoRR (2023)
Keyphrases
- language model
- generalization bounds
- data dependent
- learning theory
- generalization ability
- language modeling
- model selection
- linear classifiers
- n gram
- ranking algorithm
- statistical learning theory
- learning problems
- document retrieval
- vc dimension
- probabilistic model
- query expansion
- test collection
- information retrieval
- ranking functions
- learning machines
- smoothing methods
- kernel machines
- prediction accuracy
- supervised learning
- active learning
- lower bound
- support vector
- feature selection