OntoExtractor: A Fuzzy-Based Approach in Clustering Semi-structured Data Sources and Metadata Generation.
Zhan CuiErnesto DamianiMarcello LeidaMarco VivianiPublished in: KES (1) (2005)
Keyphrases
- semi structured
- structured data
- data sources
- metadata
- data model
- data collections
- web data sources
- information integration
- fuzzy clustering
- unstructured data
- data integration
- clustering algorithm
- free text
- web data
- heterogeneous data
- information extraction
- fuzzy sets
- semi structured data
- databases
- clustering method
- search interface
- data warehouse
- structured knowledge
- k means
- text mining
- data repositories
- wrapper generation
- web documents
- information sources
- semi structured documents
- web sources
- database
- web data extraction
- html pages
- content and structure
- digital libraries
- probabilistic xml
- query language
- keywords
- relational databases