ProdExt: A Knowledge-Based Wrapper for Extraction of Technical and Scientific Production in Web Pages.
Carla C. R. NunesFlávia A. BarrosPublished in: IBERAMIA-SBIA 2000 Open Discussion Track (2000)
Keyphrases
- web information extraction
- web pages
- data extraction
- web data extraction
- information extraction
- website
- web data
- html documents
- wrapper induction
- expert systems
- web content mining
- extraction rules
- semi structured
- search engine
- web search engines
- production system
- keywords
- web documents
- web users
- scientific data
- technical reports
- production planning
- science education
- production process
- web page classification
- web search
- xml documents
- artificial intelligence
- web server
- graphic design
- wide variety
- web graph
- automatic extraction
- black box