Towards Building More Robust NER datasets: An Empirical Study on NER Dataset Bias from a Dataset Difficulty View.
Ruotian MaXiaolei WangXin ZhouQi ZhangXuanjing HuangPublished in: EMNLP (2023)
Keyphrases
- named entity recognition
- synthetic datasets
- benchmark datasets
- maximum entropy
- information extraction
- named entities
- training dataset
- real life
- database
- uci datasets
- artificial intelligence
- massive datasets
- image dataset
- high dimensional datasets
- natural language processing
- text summarization
- multiple views
- conditional random fields
- feature set
- million images
- standard learning algorithms