Training Data Augmentation for Code-Mixed Translation.
Abhirut GuptaAditya VavreSunita SarawagiPublished in: NAACL-HLT (2021)
Keyphrases
- training data
- data sets
- supervised learning
- decision trees
- training set
- source code
- test data
- machine translation
- training process
- learning algorithm
- code generation
- labeled data
- classification models
- noisy data
- prior knowledge
- test set
- training examples
- training samples
- classification accuracy
- domain knowledge
- class labels
- semi supervised learning
- cross language information retrieval
- statistical machine translation
- training corpus
- learned from training data