PDFDataExtractor: A Tool for Reading Scientific Text and Interpreting Metadata from the Typeset Literature in the Portable Document Format.
Miao ZhuJacqueline M. ColePublished in: J. Chem. Inf. Model. (2022)
Keyphrases
- metadata
- scientific literature
- scientific publications
- digital documents
- digital libraries
- scientific papers
- database
- text mining
- databases
- google scholar
- web documents
- text documents
- free text
- data mining
- semantic content
- lightweight
- learning resources
- scientific documents
- multimedia
- keywords
- open access
- software tools
- reading comprehension
- earth science
- multimedia documents
- information retrieval
- audio content
- search tools