A Corpus of Scientific Biomedical Texts Spanning over 168 Years Annotated for Uncertainty.
Ramona BongelliCarla CanestrariIlaria RiccioniAndrzej ZuczkowskiCinzia BuldoriniRicardo PietrobonAlberto LavelliBernardo MagniniPublished in: LREC (2012)
Keyphrases
- genia corpus
- text mining
- named entities
- medline abstracts
- annotated corpus
- manually annotated
- named entity recognition
- scientific literature
- scientific papers
- relation extraction
- text documents
- information extraction
- years ago
- natural language text
- data mining
- natural language processing
- robust optimization
- text classification
- environmental sciences
- co occurrence
- information extraction systems
- artificial intelligence
- biomedical literature
- automatic annotation
- question answering
- newspaper articles
- knowledge discovery
- inter annotator agreement