Improving topic model source code summarization.
Paul W. McBurneyCheng LiuCollin McMillanTim WeningerPublished in: ICPC (2014)
Keyphrases
- source code
- topic models
- latent dirichlet allocation
- open source
- topic modeling
- software systems
- text documents
- text mining
- latent topics
- probabilistic model
- software projects
- co occurrence
- probabilistic topic models
- high level
- software maintenance
- plagiarism detection
- software evolution
- free software
- latent topic model
- generative model
- software repositories
- text files
- artificial intelligence
- databases
- information extraction
- program understanding
- prior knowledge