AUG-BERT: An Efficient Data Augmentation Algorithm for Text Classification.
Linqing ShiDanyang LiuGongshen LiuKui MengPublished in: CSPS (2019)
Keyphrases
- input data
- noisy data
- text classification
- data sets
- data processing
- detection algorithm
- computational complexity
- data sources
- search space
- dynamic programming
- computationally efficient
- k means
- segmentation algorithm
- learning algorithm
- synthetic datasets
- data cleaning
- data reduction
- data mining techniques
- knowledge discovery
- worst case
- machine learning
- information loss
- data quality
- n gram
- optimal solution
- data analysis
- xml documents
- expectation maximization
- classification algorithm
- missing values
- training data
- probabilistic model
- support vector machine
- feature selection
- text mining
- database