Filtering Web Documents for eDot, a food risk warehouse.
Amar-Djalil MezaourPublished in: International Conference on Computational Intelligence (2004)
Keyphrases
- web documents
- information extraction
- semi structured
- data warehouse
- document classification
- web pages
- web search engines
- textual information
- web content
- keywords
- web data
- html documents
- focused crawling
- vector space model
- machine learning
- dynamically generated
- document representation
- text mining
- databases
- unstructured documents