Detecting Tables in HTML Documents.
Yalin WangJianying HuPublished in: Document Analysis Systems (2002)
Keyphrases
- html documents
- web documents
- automatic extraction
- semi structured
- semantic information
- structured documents
- web pages
- semistructured data
- web content
- web page retrieval
- xml documents
- information extraction
- repeated patterns
- structured data
- high level
- knowledge base
- topic maps
- semi structured data
- machine learning
- database