Few-shot Task-agnostic Neural Architecture Search for Distilling Large Language Models.
Dongkuan XuSubhabrata MukherjeeXiaodong LiuDebadeepta DeyWenhui WangXiang ZhangAhmed Hassan AwadallahJianfeng GaoPublished in: NeurIPS (2022)
Keyphrases
- language model
- neural architecture
- language modeling
- document retrieval
- speech recognition
- n gram
- probabilistic model
- document ranking
- neural network
- test collection
- search space
- language modelling
- retrieval model
- statistical language models
- query specific
- information retrieval
- query expansion
- smoothing methods
- training data
- basis functions
- query terms
- recurrent neural networks
- video data
- feature selection
- genetic algorithm