• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes.

Cheng-Yu HsiehChun-Liang LiChih-Kuan YehHootan NakhostYasuhisa FujiiAlexander RatnerRanjay KrishnaChen-Yu LeeTomas Pfister
Published in: CoRR (2023)
Keyphrases
  • probabilistic model
  • language model
  • training data
  • statistical model
  • information retrieval
  • document retrieval
  • translation model
  • decision trees
  • retrieval model