Analyzing Data Selection Techniques with Tools from the Theory of Information Losses.
Brandon FoggoNanpeng YuPublished in: IEEE BigData (2021)
Keyphrases
- raw data
- end users
- data sets
- information sources
- database
- complex data
- data from multiple sources
- huge amounts
- domain knowledge
- background knowledge
- sensor data
- training data
- statistical analysis
- heterogeneous sources
- data sources
- prior knowledge
- web data
- historical data
- domain experts
- information processing
- data mining tools
- data collection
- computer systems
- stored data
- data points
- private information
- multimedia data
- data processing
- global information
- collected data
- high dimensional data
- structural information
- synthetic data
- data repositories
- data quality
- xml documents
- information resources
- data analysis
- information services
- semantic web standards
- heterogeneous data
- external data
- software repositories
- digital data
- missing information
- essential information
- log data
- original data
- statistical methods
- spatial data
- missing data
- user interaction
- input data
- image data
- knowledge discovery
- website
- knowledge base
- feature selection