Sharing Copies of Synthetic Clinical Corpora without Physical Distribution - A Case Study to Get Around IPRs and Privacy Constraints Featuring the German JSYNCC Corpus.
Christina LohrSven BuechelUdo HahnPublished in: LREC (2018)
Keyphrases
- physical constraints
- real world
- privacy preserving
- text corpora
- annotated corpus
- parallel corpus
- document corpus
- text corpus
- personal information
- statistical machine translation
- private information
- wide coverage
- text collections
- text data
- clinical data
- clinical practice
- data sharing
- manually annotated
- information sharing
- text categorization
- natural language processing
- word frequency
- probability distribution
- natural language
- case study