On the Economics of Multilingual Few-shot Learning: Modeling the Cost-Performance Trade-offs of Machine Translated and Manual Data.
Kabir AhujaMonojit ChoudhurySandipan DandapatPublished in: NAACL-HLT (2022)
Keyphrases
- data sets
- data processing
- trade off
- prior knowledge
- learning algorithm
- learning systems
- data sources
- data points
- synthetic data
- data collection
- database
- learning models
- high quality
- spatial data
- background knowledge
- human experts
- storage space
- image data
- data mining techniques
- knowledge discovery
- probability distribution
- semi automatic
- end users
- data quality
- reinforcement learning
- training data
- labor intensive