Generating artificial texts as substitution or complement of training data.
Vincent ClaveauAntoine ChaffinEwa KijakPublished in: CoRR (2021)
Keyphrases
- training data
- test data
- data sets
- learning algorithm
- training corpus
- training set
- automatically generating
- real world
- training process
- classification accuracy
- class labels
- learned from training data
- legal texts
- natural language generation
- generalization error
- training examples
- training samples
- keywords
- decision trees
- test set
- naive bayes
- classification models
- input data
- co occurrence
- training dataset
- natural language text
- support vector machine
- training instances
- text segmentation
- machine learning
- data mining