Unsupervised Data Augmentation for Less-Resourced Languages with no Standardized Spelling.
Alice MillourKarën FortPublished in: RANLP (2019)
Keyphrases
- data sets
- data analysis
- image data
- data sources
- database
- synthetic data
- data points
- data quality
- raw data
- data distribution
- spatial data
- input data
- databases
- data structure
- training data
- high quality
- bayesian networks
- data objects
- original data
- information retrieval
- expressive power
- sensor data
- end users
- data collection
- data processing
- small number
- training set
- knowledge discovery
- active learning