Metadata-Based Clustering and Selection of Metadata Items for Similar Dataset Discovery and Data Combination Tasks.
Takeshi SakumotoTeruaki HayashiHiroki SakajiHirofumi NonakaPublished in: IEEE Access (2024)
Keyphrases
- metadata
- data sets
- database
- digital libraries
- earth science
- data points
- categorical data
- databases
- multidimensional data
- data mining tasks
- data collection
- data objects
- data repositories
- information resources
- high dimensional data
- knowledge discovery
- learning objects
- k means
- data analysis
- data structure
- earth science data
- scientific publications
- digital collections
- geo referenced
- clustering algorithm
- training data
- statistical information
- synthetic datasets
- data reduction
- data sources
- original data
- training dataset
- heterogeneous data
- geospatial data
- heterogeneous sources
- semantic data
- data mining
- geographic data
- environmental data
- data clustering
- synthetic data