Login / Signup
CENTAURUS: A Dynamic Parser Generator for Parallel Ad Hoc Data Extraction.
Shigeyuki Sato
Hiroka Ihara
Kenjiro Taura
Published in:
J. Inf. Process. (2020)
Keyphrases
</>
data extraction
semi structured
web data extraction
data integration
web sources
web pages
information extraction
data mining
email
data model
knn
natural language
natural language processing
information retrieval systems
similarity measure
website
artificial intelligence
machine learning
html pages
databases