An Informative DOM Subtree Identification Method from Web Pages in Unfamiliar Web Sites.
Masanobu TsurutaHiroyuki SakaiShigeru MasuyamaPublished in: IEICE Trans. Inf. Syst. (2008)
Keyphrases
- website
- web pages
- pairwise
- cost function
- tree structure
- detection method
- high accuracy
- preprocessing
- experimental evaluation
- support vector machine
- dynamically generated
- web server
- high precision
- web documents
- computational cost
- significant improvement
- relational databases
- computational complexity
- clustering method
- data model
- web content
- video sequences