The unreasonable effectiveness of traditional information retrieval in crash report deduplication.
Joshua Charles CampbellEddie Antonio SantosAbram HindlePublished in: MSR (2016)
Keyphrases
- decision trees
- information retrieval
- search engine
- real time
- ir evaluation
- information retrieval systems
- data mining
- artificial intelligence
- information systems
- text mining
- data sets
- source code
- text categorization
- question answering
- test collection
- learning to rank
- language processing
- record linkage
- database
- probability ranking principle