DEplain: A German Parallel Corpus with Intralingual Translations into Plain Language for Sentence and Document Simplification.
Regina StoddenOmar MomenLaura KallmeyerPublished in: ACL (1) (2023)
Keyphrases
- parallel corpus
- machine translation
- source language
- query translation
- machine translation system
- parallel texts
- target language
- word alignment
- cross language
- cross lingual
- cross language information retrieval
- language independent
- bilingual dictionaries
- sentence pairs
- document clustering
- document classification
- statistical machine translation
- document collections
- information retrieval
- document retrieval
- latent semantic analysis
- text classification
- word level
- parallel corpora
- keywords
- information retrieval systems
- information extraction
- natural language
- retrieval systems
- query expansion
- text retrieval
- machine learning
- multiword
- translation model
- question answering