Login / Signup
PDFdigest: an Adaptable Layout-Aware PDF-to-XML Textual Content Extractor for Scientific Articles.
Daniel Ferrés
Horacio Saggion
Francesco Ronzano
Àlex Bravo
Published in:
LREC (2018)
Keyphrases
</>
textual content
scientific articles
scientific literature
xml documents
topic modeling
textual information
xml data
news articles
keywords
web pages
metadata
data model
databases
machine learning
knowledge base