Login / Signup
Text and Layout Information Extraction from Document Files of Various Formats Based on the Analysis of Page Description Language.
Takashi Hirano
Yuichi Okano
Yasuhiro Okada
Fumio Yoda
Published in:
ICDAR (2007)
Keyphrases
</>
description language
information extraction
text documents
information retrieval
document layout
document analysis
keywords
page layout
text mining
web documents
page layout analysis
website
software architecture
web pages
document collections
high level
search engine
machine learning
logic programs
free text
metadata