Normalization and Back-Transliteration for Code-Switched Data.
Dwija ParikhThamar SolorioPublished in: CALCS@NAACL (2021)
Keyphrases
- data sets
- data structure
- database
- missing data
- data collection
- training data
- high quality
- complex data
- data analysis
- raw data
- data sources
- historical data
- synthetic data
- computer systems
- sensor networks
- information systems
- bayesian networks
- knowledge discovery
- probability distribution
- small number
- spatial data
- data objects
- original data
- feature space
- statistical methods
- data distribution
- experimental data
- end users
- data points
- data processing
- input data
- image data