Topic modeling of public repositories at scale using names in source code.
Vadim MarkovtsevEiso KantPublished in: CoRR (2017)
Keyphrases
- source code
- topic modeling
- topic models
- open source
- software systems
- latent dirichlet allocation
- text mining
- software projects
- software repositories
- open source software
- software maintenance
- text classification
- open source projects
- text files
- mining software repositories
- high level
- open source software projects
- software evolution
- collaborative filtering
- version control
- text documents
- named entities
- website
- address these issues
- software quality
- image classification
- plagiarism detection
- free software
- source files
- information retrieval