UMUTeam at AI-SOCO'2020: Source Code Authorship Identification based on Character N-Grams and Author's Traits.
José Antonio García-DíazRafael Valencia-GarcíaPublished in: FIRE (Working Notes) (2020)
Keyphrases
- source code
- authorship attribution
- open source
- character n grams
- software systems
- artificial intelligence
- plagiarism detection
- software maintenance
- n gram
- software projects
- writing style
- software evolution
- high level
- program understanding
- source files
- bug reports
- cross language information retrieval
- software repositories
- variable length
- cross language
- website
- search engine