Toxicity in Multilingual Machine Translation at Scale.
Marta R. Costa-jussàEric SmithChristophe RopersDaniel LichtJavier FerrandoCarlos EscolanoPublished in: CoRR (2022)
Keyphrases
- machine translation
- cross lingual
- language resources
- language independent
- cross language information retrieval
- multilingual documents
- chinese english
- language specific
- machine translation system
- cross lingual information retrieval
- comparable corpora
- language processing
- natural language processing
- information extraction
- parallel corpus
- word sense disambiguation
- natural language
- statistical machine translation
- target language
- parallel corpora
- information retrieval
- word alignment
- natural language generation
- query translation
- cross language
- artificial intelligence
- multilingual information retrieval
- brazilian portuguese
- tasks in natural language processing
- finite state transducers
- bilingual dictionaries
- language modeling
- named entities
- lexical knowledge
- data mining
- bilingual lexicon