CAT-LM: Training Language Models on Aligned Code And Tests.
Nikitha RaoKush JainUri AlonClaire Le GouesVincent J. HellendoornPublished in: CoRR (2023)
Keyphrases
- language model
- language modeling
- speech recognition
- n gram
- probabilistic model
- retrieval model
- context sensitive
- document retrieval
- information retrieval
- language modelling
- query expansion
- query terms
- training set
- test collection
- translation model
- statistical language models
- statistical language modeling
- smoothing methods
- document ranking
- vector space model
- passage retrieval
- relevance model
- term dependencies
- information retrieval systems
- co occurrence
- clustering algorithm