A Bootstrapping-based Method to Automatically Identify Data-usage Statements in Publications.
Qiuzi ZhangQikai ChengYong HuangWei LuPublished in: J. Data Inf. Sci. (2017)
Keyphrases
- synthetic data
- data sets
- input data
- missing data
- high accuracy
- detection method
- computational cost
- image data
- data analysis
- prior knowledge
- data processing
- test data
- training samples
- preprocessing
- data collection
- statistical methods
- high precision
- missing values
- user input
- raw data
- prior information
- noisy data
- data quality
- information loss
- training data
- clustering method
- statistical analysis
- knowledge discovery
- data points
- probability distribution
- cost function
- high quality
- image segmentation
- segmentation method
- database
- data mining techniques
- information extraction
- data sources
- objective function