Sample size determination for biomedical big data with limited labels.
Aaron N. RichterTaghi M. KhoshgoftaarPublished in: Netw. Model. Anal. Health Informatics Bioinform. (2020)
Keyphrases
- sample size
- big data
- unstructured data
- cloud computing
- big data analytics
- business intelligence
- data management
- model selection
- small sample
- data processing
- progressive sampling
- data analysis
- random sampling
- social media
- knowledge discovery
- upper bound
- vast amounts of data
- statistical hypothesis testing
- data science
- statistical power
- pac learning
- text mining
- information extraction
- data warehousing
- variance reduction
- massive datasets
- training data
- information processing
- random sample
- worst case
- active learning
- objective function
- decision trees
- data driven decision making