A Survey on Web Content Mining and Extraction of Structured and Semistructured Data.
Kshitija PolNita PatilShreya PatankarChhaya DasPublished in: ICETET (2008)
Keyphrases
- semistructured data
- web content mining
- structured data
- web data extraction
- semi structured
- information extraction
- data extraction
- web mining
- data model
- query language
- regular expressions
- semistructured databases
- web data
- regular path queries
- raw data
- information retrieval
- xml documents
- web documents
- xml databases
- heterogeneous data
- database systems
- pattern matching
- pattern mining
- relational databases
- tree patterns
- website
- metadata
- machine learning