Extracting Body Text from Academic PDF Documents for Text Mining.
Changfeng YuCheng ZhangJie WangPublished in: KDIR (2020)
Keyphrases
- text mining
- text documents
- pdf documents
- textual documents
- textual data
- text clustering
- scientific documents
- information retrieval
- natural language processing
- text data
- knowledge discovery
- information extraction
- text classification
- document clustering
- text analytics
- text processing
- biomedical literature
- scientific literature
- data mining
- text corpora
- data analysis
- machine learning
- databases
- text retrieval
- automatic extraction
- named entities
- knowledge representation
- web content
- document retrieval
- computational linguistics