An ensemble learning approach for software semantic clone detection.
Min FuGang LuoJames Xi ZhengTianyi ZhangDongjin YuMiryung KimPublished in: CoRR (2020)
Keyphrases
- ensemble learning
- clone detection
- software systems
- linux kernel
- software reuse
- string matching
- generalization ability
- ensemble methods
- source code
- random forest
- software engineering
- high level
- software development
- base classifiers
- pattern matching
- concept drift
- unlabeled data
- multi class
- software components
- machine learning
- computer systems
- open source
- feature selection
- learning algorithm