SunBear at WNUT-2020 Task 2: Improving BERT-Based Noisy Text Classification with Knowledge of the Data domain.
Linh Doan BaoViet Anh NguyenQuang Pham HuuPublished in: W-NUT@EMNLP (2020)
Keyphrases
- text classification
- data sets
- raw data
- domain experts
- data mining techniques
- noisy data
- prior knowledge
- knowledge discovery
- data collection
- data analysis
- high quality
- text data
- knowledge management
- database
- human experts
- information retrieval
- missing data
- data mining
- data quality
- expert knowledge
- data structure
- image data
- domain knowledge
- active learning
- data cleaning
- general knowledge
- erroneous data
- labeled data
- background knowledge
- data integration
- unlabeled data
- expert systems
- data points
- probability distribution
- data sources