Source Code Author Identification Based on N-gram Author Profiles.
Georgia FrantzeskouEfstathios StamatatosStefanos GritzalisSokratis K. KatsikasPublished in: AIAI (2006)
Keyphrases
- n gram
- source code
- author identification
- language independent
- language model
- authorship attribution
- open source
- software systems
- text classification
- highly skewed
- software maintenance
- software projects
- software evolution
- language modeling
- part of speech
- plagiarism detection
- software repositories
- query expansion
- text categorization
- web documents
- website
- machine learning
- high level
- decision trees
- probabilistic model