A Graph-Based Framework for Web Document Mining.
Adam SchenkerHorst BunkeMark LastAbraham KandelPublished in: Document Analysis Systems (2004)
Keyphrases
- web documents
- web logs
- prefetching
- information extraction
- web pages
- web content
- semi structured
- web mining
- web usage mining
- web search engines
- vector space model
- document classification
- textual information
- text mining
- keywords
- data mining techniques
- association rule mining
- knowledge discovery
- sequential patterns
- data structure
- html documents
- document representation
- dynamically generated
- data mining
- unstructured documents
- mining algorithm
- pattern mining
- data mining algorithms
- itemsets
- website
- search engine
- machine learning