Duplication Detection for Software Bug Reports Based on BM25 Term Weighting.
Cheng-Zen YangHung-Hsueh DuSin-Sian WuIng-Xiang ChenPublished in: TAAI (2012)
Keyphrases
- term weighting
- bug reports
- information retrieval
- text retrieval
- text categorization
- open source projects
- source code
- tf idf
- software maintenance
- term frequency
- software projects
- language modeling
- retrieval systems
- software development
- term weighting schemes
- term weights
- software systems
- vector space model
- weighting schemes
- open source
- query terms
- software developers
- text classification
- software artifacts
- software design
- software engineers
- software evolution
- software engineering
- feature selection
- average precision
- software architecture
- software components
- retrieval model
- development process