Subtopic mining using simple patterns and hierarchical structure of subtopic candidates from web documents.
Se-Jong KimJong-Hyeok LeePublished in: Inf. Process. Manag. (2015)
Keyphrases
- web documents
- hierarchical structure
- web pages
- web logs
- web directories
- hierarchical structures
- hierarchical classification
- web search engines
- information extraction
- hierarchically structured
- query logs
- pattern mining
- frequent patterns
- hierarchical organization
- semi structured
- data mining techniques
- keywords
- vector space model
- html documents
- sequential patterns
- mining frequent
- web content
- search engine
- website
- text mining
- image representation
- web search
- tree structure
- web mining
- mining algorithm
- web data
- test collection
- link structure
- information retrieval
- text classification
- frequent pattern mining
- query processing
- feature space
- multiscale
- metadata
- computer vision