Scalable Machine Learning with Granulated Data Summaries: A Case of Feature Selection.
Agnieszka Chadzynska-KrasowskaPawel BetlinskiDominik SlezakPublished in: ISMIS (2017)
Keyphrases
- machine learning
- feature selection
- data sets
- data analysis
- database
- raw data
- knowledge discovery
- data mining techniques
- image data
- data collection
- data distribution
- data structure
- original data
- machine learning methods
- missing values
- synthetic data
- data quality
- machine learning algorithms
- statistical analysis
- feature subset
- knowledge acquisition
- text classification
- input data
- mutual information
- text mining
- data points
- probability distribution
- support vector
- high quality
- training data
- databases