Login / Signup

A cognitive crawler using structure pattern for incremental crawling and content extraction.

Shijia XiFuchun SunJianmin Wang
Published in: IEEE ICCI (2010)
Keyphrases
  • content extraction
  • search engine
  • web pages
  • text content
  • focused crawling
  • web crawling
  • pattern matching
  • website
  • machine learning
  • information retrieval
  • domain knowledge
  • web documents
  • statistical learning
  • web crawler