GGPONC 2.0 - The German Clinical Guideline Corpus for Oncology: Curation Workflow, Annotation Policy, Baseline NER Taggers.
Florian BorchertChristina LohrLuise ModersohnJonas WittThomas LangerMarkus FollmannMatthias GietzeltBert ArnrichUdo HahnMatthieu-P. SchapranowPublished in: LREC (2022)
Keyphrases
- annotated corpus
- computer interpretable
- named entity recognition
- pos tagging
- semi automatically
- named entities
- clinical guidelines
- genia corpus
- automatic annotation
- maximum entropy classifier
- patient data
- part of speech
- information extraction
- relation extraction
- semantic annotation
- natural language processing
- modeling language
- inter annotator agreement
- maximum entropy
- conditional random fields
- ischemic stroke
- optimal policy
- semi supervised
- metadata creation
- clinical trials
- word segmentation
- clinical decision support systems
- clinical practice
- text summarization
- clinical data
- co occurrence
- dependency parsing
- clinical practice guidelines
- machine translation
- semi automatic
- medical knowledge
- machine learning
- chinese named entity recognition
- temporal constraints
- intraoperative
- free text
- active learning
- medical data
- health care
- n gram
- text mining
- digital libraries