Collaborative Information Extraction and Mining from Multiple Web Documents.
Tak-Lam WongWai LamShing-Kit ChanPublished in: SDM (2006)
Keyphrases
- web documents
- information extraction
- text mining
- web logs
- semi structured
- unstructured documents
- web mining
- natural language processing
- web search engines
- textual data
- named entities
- document classification
- textual information
- web pages
- structured data
- mining algorithm
- web content
- text documents
- data mining techniques
- web data
- information retrieval
- vector space model
- relation extraction
- html documents
- knowledge discovery
- frequent itemsets
- natural language
- data extraction
- focused crawling
- text classification