Clustering Heterogeneous Semi-structured Social Science Datasets.
David B. SkillicornChristian LeuprechtPublished in: ICCS (2015)
Keyphrases
- semi structured
- social sciences
- structured data
- data collections
- semi structured data
- information integration
- computer science
- clustering algorithm
- data extraction
- data model
- information extraction
- digital government
- text mining
- web documents
- free text
- web data
- social scientists
- unstructured data
- digital archiving
- k means
- knowledge rich
- semi structured documents
- wrapper generation
- database
- web data extraction
- diverse fields
- database systems
- metadata
- machine learning
- content and structure
- databases
- web data sources