MultiCoNER: A Large-scale Multilingual dataset for Complex Named Entity Recognition.
Shervin MalmasiAnjie FangBesnik FetahuSudipta KarOleg RokhlenkoPublished in: CoRR (2022)
Keyphrases
- named entity recognition
- information extraction
- named entities
- natural language processing
- real world
- text summarization
- maximum entropy
- conditional random fields
- semi supervised
- annotated corpus
- classifier ensemble
- relation extraction
- proper names
- machine translation
- data sets
- sequence labeling
- feature extraction
- cross lingual
- machine learning
- benchmark datasets
- information retrieval
- prior knowledge