Login / Signup
DOM based content extraction via text density.
Fei Sun
Dandan Song
Lejian Liao
Published in:
SIGIR (2011)
Keyphrases
</>
content extraction
text content
html documents
web news
web pages
web documents
digital archives
xml documents
semantic information
keywords
text documents
text retrieval
website
information retrieval
machine learning
relational databases
domain knowledge
database
multimedia information retrieval