Efficient ML Lifecycle Transferring for Large-Scale and High-Dimensional Data via Core Set-Based Dataset Similarity.
Van-Duc LeTien-Cuong BuiWen-Syan LiPublished in: IEEE Access (2023)
Keyphrases
- high dimensional data
- high dimensional
- data sets
- high dimensional datasets
- low dimensional
- subspace clustering
- high dimensionality
- high dimensions
- data points
- dimensionality reduction
- input space
- original data
- clustering high dimensional data
- database
- manifold learning
- dimension reduction
- input data
- feature set
- nearest neighbor
- similarity search
- subspace learning
- decision trees
- nonlinear dimensionality reduction
- real world