gaHealth: An English-Irish Bilingual Corpus of Health Data.
Séamus LankfordHaithem AfliÓrla Ní LoinsighAndy WayPublished in: CoRR (2024)
Keyphrases
- health data
- parallel corpus
- sentence pairs
- statistical machine translation
- parallel corpora
- health care
- machine translation
- cross lingual
- comparable corpora
- chinese english
- multiword
- cross language information retrieval
- machine translation system
- electronic health records
- word alignment
- english chinese
- cross language
- query translation
- target language
- bilingual dictionaries
- temporal databases
- word pairs
- source language
- clinical trials
- language model
- machine learning
- natural language