Logic Wrappers and XSLT Transformations for Tuples Extraction from HTML.
Costin BadicaAmelia BadicaPublished in: XSym (2005)
Keyphrases
- html documents
- web information extraction
- news pages
- automatic extraction
- information extraction
- content extraction
- semi structured
- web pages
- web documents
- data extraction
- web news
- web content
- textual content
- semantic information
- semistructured data
- structured documents
- modal logic
- xml documents
- html pages
- news articles
- classical logic
- extraction rules
- machine learning
- wrapper induction
- semi structured data
- website
- logic programming
- data streams
- search engine
- markup language
- web data
- structured data