Selecting Machine-Translated Data for Quick Bootstrapping of a Natural Language Understanding System.
Judith GaspersPenny KaranasouRajen ChatterjeePublished in: CoRR (2018)
Keyphrases
- data sets
- data quality
- high quality
- data sources
- synthetic data
- data collection
- image data
- training data
- raw data
- data mining techniques
- data distribution
- data points
- application domains
- probability distribution
- database
- original data
- knowledge representation
- noisy data
- experimental data
- prior knowledge
- expert systems
- bayesian networks
- social networks
- real time