Clustering Source Code Elements by Semantic Similarity Using Wikipedia.
Mirco SchindlerOliver FoxAndreas RauschPublished in: RAISE@ICSE (2015)
Keyphrases
- source code
- semantic similarity
- open source
- co occurrence
- software systems
- similarity measure
- open source software
- wordnet
- semantic similarity computation
- software projects
- clustering method
- semantically similar
- software maintenance
- vector space model
- semantic information
- semantic features
- k means
- software evolution
- high level
- free software
- sentence similarity
- gene ontology
- software repositories
- plagiarism detection
- document clustering
- text files
- bug localization
- search engine
- word sense disambiguation
- impact analysis
- program understanding
- maintenance activities
- source files
- data mining