Building a comparable corpus and a benchmark for Spanish medical text simplification.
Leonardo Campillos LlanosAna R. Terroba ReinaresSofía Zakhir PuigAna Valverde-MateosAdrián Capllonch-CarriónPublished in: Proces. del Leng. Natural (2022)
Keyphrases
- supervised machine learning
- machine translation system
- broad coverage
- open domain
- spanish language
- text data
- text corpora
- document corpus
- question answering
- topic segmentation
- medical diagnosis
- recognizing textual entailment
- information retrieval
- sentence level
- text corpus
- newspaper articles
- text mining
- plain text
- lexical features
- linguistic patterns
- specific domains
- medical imaging
- anaphora resolution
- english words
- scientific papers
- multiresolution
- training corpus
- medical domain
- natural language text
- medical data
- world knowledge
- temporal expressions
- document level
- multiword
- medical knowledge
- text retrieval
- medical information
- keywords
- text classification
- part of speech
- text documents