Performance of Data Augmentation Methods for Brazilian Portuguese Text Classification.
Marcellus AmadeusPaulo BrancoPublished in: CoRR (2023)
Keyphrases
- text classification
- data sets
- data mining methods
- noisy data
- high dimensional data
- data processing
- statistical methods
- data mining techniques
- statistical analysis
- training data
- data sources
- knowledge discovery
- image data
- input data
- database
- spectral clustering
- feature extraction
- missing values
- data points
- text data
- decision trees
- data collection
- data analysis
- information theoretic
- bag of words
- missing data
- human experts
- data quality
- databases
- significant improvement