Login / Signup
VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups.
Zejiang Shen
Kyle Lo
Lucy Lu Wang
Bailey Kuehl
Daniel S. Weld
Doug Downey
Published in:
Trans. Assoc. Comput. Linguistics (2022)
Keyphrases
</>
content extraction
web news
probability density function
visual features
text content
html documents
data mining
natural language processing