Declarative Information Extraction, Web Crawling, and Recursive Wrapping with Lixto.
Robert BaumgartnerSergio FlescaGeorg GottlobPublished in: LPNMR (2001)
Keyphrases
- web crawling
- information extraction
- web data extraction
- web mining
- data extraction
- semi structured
- web data
- web sources
- deep web
- text mining
- search engine
- focused crawling
- link analysis
- natural language processing
- topic specific
- query interface
- web documents
- web pages
- structured data
- information integration
- data mining and machine learning
- knowledge representation
- application of data mining
- data integration
- web databases
- information retrieval
- data mining
- text categorization
- named entities
- database
- natural language
- machine learning
- data mining techniques
- co occurrence
- web search
- data model