LeNER-Br: A Dataset for Named Entity Recognition in Brazilian Legal Text.
Pedro Henrique Luz de AraujoTeófilo E. de CamposRenato R. R. de OliveiraMatheus StaufferSamuel CoutoPaulo BermejoPublished in: PROPOR (2018)
Keyphrases
- named entity recognition
- text summarization
- information extraction
- named entities
- named entity disambiguation
- proper names
- natural language processing
- named entity recognizer
- maximum entropy
- text documents
- semi supervised
- conditional random fields
- text mining
- relation extraction
- sequence labeling
- annotated corpus
- classifier ensemble
- benchmark datasets
- feature set
- natural language
- information retrieval
- databases
- co occurrence
- reference resolution
- chinese named entity recognition
- pos taggers
- maximum entropy classifier
- noun phrases
- semantic relations
- question answering
- knowledge representation
- active learning
- keywords
- machine learning