Keyphrases
- data sets
- data analysis
- original data
- data collection
- database
- image data
- end users
- data sources
- data points
- knowledge discovery
- website
- data processing
- data objects
- web data
- synthetic data
- web documents
- information sources
- data structure
- web applications
- training data
- high quality
- prior knowledge
- statistical analysis
- essential information
- web communities
- constantly growing
- information retrieval
- raw data
- clustering algorithm
- xml documents