Sample Size in Natural Language Processing within Healthcare Research.
Jaya ChaturvediDiana ShamsutdinovaFelix ZimmerSumithra VelupillaiDaniel StahlRobert StewartAngus RobertsPublished in: CoRR (2023)
Keyphrases
- sample size
- natural language processing
- model selection
- machine learning
- information extraction
- random sampling
- small sample
- text mining
- upper bound
- covariance matrix
- statistical tests
- confidence intervals
- natural language
- statistical power
- small sample size
- progressive sampling
- worst case
- statistical hypothesis testing
- number of training samples
- random sample
- pac learning
- knowledge representation
- variance reduction
- experimental design
- generalization error
- small samples
- data sets