Pattern discovery for semi-structured web pages using bar-tree representation
Zaenal AkbarLaksana Tri HandokoPublished in: CoRR (2011)
Keyphrases
- semi structured
- tree representation
- pattern discovery
- recursive partitioning
- web documents
- web pages
- data extraction
- web information extraction
- html pages
- web data extraction
- web data
- tree structure
- binary tree
- structured data
- web data sources
- information extraction
- tree structures
- website
- pattern mining
- search engine
- data analysis
- data model
- text mining
- web content
- association rule mining
- data mining
- sequential patterns
- quadtree
- semistructured data
- keywords
- hierarchical structure
- xml databases
- monte carlo
- index structure
- search algorithm