Collection Methods and Data Characteristics of the PagkataoKo Dataset.
Edward TigheLuigi AcordaAlexander Ii AgnoJesah GanoTimothy GoGabriel SantiagoClaude SedilloPublished in: PACLIC (2022)
Keyphrases
- data analysis
- data sets
- data mining techniques
- database
- high dimensional data
- statistical methods
- data processing
- data mining methods
- computer systems
- benchmark datasets
- data sources
- high quality
- knowledge discovery
- image data
- data representations
- data structure
- raw data
- statistical tests
- noisy data
- sampling methods
- predictive model
- massive datasets
- complex structures
- data reduction
- multiple sources
- data mining applications
- data quality
- original data
- databases
- data distribution
- decision trees
- data collection
- training data
- prior knowledge
- preprocessing