CDJUR-BR - A Golden Collection of Legal Document from Brazilian Justice with Fine-Grained Named Entities.
Antônio MauricioVládia PinheiroVasco FurtadoJoão Araújo Monteiro NetoFrancisco das Chagas Jucá BomfimAndré Câmara Ferreira da CostaRaquel de V. SilveiraNilsiton AragãoPublished in: CoRR (2023)
Keyphrases
- fine grained
- named entities
- text documents
- text corpus
- document collections
- coarse grained
- global context
- text collections
- named entity recognition
- noun phrases
- information extraction
- question answering
- named entity extraction
- text mining
- co occurrence
- news corpus
- relation extraction
- access control
- natural language processing
- information retrieval
- web documents
- information retrieval systems
- annotated corpus
- database
- unsupervised learning
- document retrieval
- tf idf
- document clustering
- test collection
- probabilistic model
- image segmentation
- relevant documents
- retrieval systems
- word sense