PDFX: fully-automated PDF-to-XML conversion of scientific literature.
Alexandru ConstantinSteve PettiferAndrei VoronkovPublished in: ACM Symposium on Document Engineering (2013)
Keyphrases
- fully automated
- scientific literature
- fully automatic
- text mining
- xml documents
- digital libraries
- scientific papers
- scientific articles
- semi automated
- scientific publications
- metadata
- data model
- text processing
- news video
- biomedical literature
- databases
- completely automated
- video data
- data integration
- information retrieval
- published literature
- keywords
- database