Leveraging Corpus Metadata to Detect Template-based Translation: An Exploratory Case Study of the Egyptian Arabic Wikipedia Edition.
Saied AlshahraniHesham HaroonAli ElfilaliMariama NjieJeanna MatthewsPublished in: CoRR (2024)
Keyphrases
- metadata
- digital libraries
- statistical machine translation
- named entity disambiguation
- parallel corpora
- wikipedia articles
- natural language text
- detection method
- semantic information
- machine translation
- multimedia
- detection algorithm
- learning objects
- parallel corpus
- world knowledge
- knowledge base
- machine translation system
- english words
- unknown words
- document corpus
- text corpus
- database
- semantic relations
- document collections
- wordnet
- link structure
- topic tracking
- databases