gaHealth: An English-Irish Bilingual Corpus of Health Data.
Séamus LankfordHaithem AfliOrla Ni LoinsighAndy WayPublished in: LREC (2022)
Keyphrases
- health data
- parallel corpus
- sentence pairs
- statistical machine translation
- parallel corpora
- machine translation
- cross lingual
- health care
- multiword
- chinese english
- comparable corpora
- cross language information retrieval
- machine translation system
- word alignment
- cross language
- english chinese
- query translation
- source language
- electronic health records
- word pairs
- bilingual dictionaries
- target language
- temporal databases
- databases
- clinical trials
- natural language processing
- semi automatic
- domain knowledge
- patient care
- expert systems
- natural language
- knowledge base