WASTK: A Weighted Abstract Syntax Tree Kernel Method for Source Code Plagiarism Detection.
Deqiang FuYanyan XuHaoran YuBoyang YangPublished in: Sci. Program. (2017)
Keyphrases
- plagiarism detection
- kernel methods
- tree kernels
- source code
- high level
- kernel function
- semantic role labeling
- relation extraction
- open source
- machine learning
- support vector machine
- feature space
- structural features
- tree structures
- structured data
- authorship attribution
- tree structure
- natural language
- support vector
- learning tasks
- parse tree
- website
- data mining
- cross language
- automatic extraction
- feature set
- semi supervised
- data sets