bioPDFX: preparing PDF scientific articles for biomedical text mining.
Shitij BhargavaTsung-Ting KuoAnkit GoyalVincent KuriGordon LinChun-Nan HsuPublished in: PeerJ Prepr. (2017)
Keyphrases
- scientific articles
- biomedical text mining
- text mining
- scientific literature
- topic modeling
- probability density function
- semi automated
- topic models
- latent dirichlet allocation
- text documents
- biomedical literature
- natural language processing
- data mining
- document clustering
- text classification
- information extraction
- real world
- text processing
- prior knowledge
- knowledge base
- information retrieval