Unmasking Biases: Exploring Gender Bias in English-Catalan Machine Translation through Tokenization Analysis and Novel Dataset.
Audrey MashCarlos EscolanoAleix SantMaite MeleroFrancesca de Luca FornaciariPublished in: LREC/COLING (2024)
Keyphrases
- machine translation
- cross lingual
- cross language information retrieval
- target language
- natural language
- statistical machine translation
- information extraction
- language processing
- parallel corpus
- machine translation system
- language independent
- word sense disambiguation
- machine transliteration
- brazilian portuguese
- multilingual information retrieval
- language resources
- tasks in natural language processing
- chinese english
- word alignment
- named entities
- parallel corpora
- information retrieval
- machine learning