Towards Understanding the Capability of Large Language Models on Code Clone Detection: A Survey.
Shihan DouJunjie ShanHaoxiang JiaWenhao DengZhiheng XiWei HeYueming WuTao GuiYang LiuXuanjing HuangPublished in: CoRR (2023)
Keyphrases
- language model
- clone detection
- linux kernel
- language modeling
- software reuse
- string matching
- document retrieval
- n gram
- software systems
- language modelling
- probabilistic model
- source code
- information retrieval
- statistical language models
- speech recognition
- retrieval model
- test collection
- query expansion
- context sensitive
- pseudo relevance feedback
- operating system
- relevance model
- document ranking
- smoothing methods
- pattern recognition
- code clones
- ad hoc information retrieval
- machine learning
- translation model
- software engineering
- multi agent
- case study