NoisyAG-News: A Benchmark for Addressing Instance-Dependent Noise in Text Classification.
Hongfei HuangTingting LiangXixi SunZikang JinYuyu YinPublished in: CoRR (2024)
Keyphrases
- text classification
- text categorization
- text mining
- news articles
- text documents
- bag of words
- labeled data
- noise level
- signal to noise ratio
- random noise
- feature selection
- noise reduction
- missing data
- data cleaning
- multi label
- naive bayes
- noisy data
- knn
- machine learning
- semantic features
- image noise
- statistically independent
- noise model
- text classifiers
- user generated content
- median filter
- text data
- n gram
- social media
- sentiment analysis
- speech recognition