Ranking-based and Classification-based Approaches for Code Author Identification.
Zhongyuan HanTang LiXiangyu WangYujie XuMenghan WuZhiran LiZhengyu WuYong HanPublished in: FIRE (Working Notes) (2020)
Keyphrases
- author identification
- machine learning
- machine learning methods
- feature vectors
- pattern classification
- decision trees
- machine learning algorithms
- classification accuracy
- supervised learning
- data sets
- support vector machine svm
- class imbalance
- decision rules
- feature selection
- benchmark data sets
- benchmark datasets
- highly skewed
- concept drift
- classification models
- data streams
- prediction accuracy
- training set
- feature space