Iterative Data Programming for Expanding Text Classification Corpora.
Neil MallinarAbhishek ShahTin Kam HoRajendra UgraniAyush GuptaPublished in: CoRR (2020)
Keyphrases
- text classification
- data analysis
- data sets
- raw data
- data collection
- text data
- data structure
- synthetic data
- data sources
- data mining
- natural language processing
- image data
- machine learning
- data mining techniques
- input data
- prior knowledge
- computer systems
- high dimensional data
- xml documents
- missing values
- training data
- data quality