STAVIES: A System for Information Extraction from Unknown Web Data Sources through Automatic Web Wrapper Generation Using Clustering Techniques.
Nikolaos PapadakisDimitrios SkoutasKonstantinos RaftopoulosTheodora A. VarvarigouPublished in: IEEE Trans. Knowl. Data Eng. (2005)
Keyphrases
- wrapper generation
- web data sources
- semi structured
- information extraction
- data extraction
- web documents
- web sources
- web information extraction
- structured data
- web data
- information integration
- text mining
- ontology population
- natural language processing
- data model
- semi structured data
- website
- information retrieval
- machine learning
- web mining
- data management
- information retrieval systems
- keyword queries
- digital libraries
- web pages