Large expert-curated database for benchmarking document similarity detection in biomedical literature search.
Peter BrownRELISH ConsortiumYaoqi ZhouPublished in: Database J. Biol. Databases Curation (2019)
Keyphrases
- high dimensional
- biomedical literature
- document similarity
- medical literature
- document clustering
- database
- text mining
- automatic extraction
- database systems
- databases
- protein protein interactions
- cosine similarity
- data model
- probabilistic model
- information extraction
- text documents
- document representation
- metadata
- social networks