On automatically tagging web documents from examples.
Nicholas Joel WoodwardWeijia XuKent NorsworthyPublished in: SIGIR (2012)
Keyphrases
- web documents
- information extraction
- web pages
- semi structured
- document classification
- keywords
- web search engines
- textual information
- web content
- focused crawling
- html documents
- training examples
- document representation
- vector space model
- metadata
- information retrieval
- structured documents
- active learning
- topic specific
- unstructured documents