Comparing the Archival Rate of Arabic, English, Danish, and Korean Language Web Pages.
Lulwah M. AlkwaiMichael L. NelsonMichele C. WeiglePublished in: ACM Trans. Inf. Syst. (2017)
Keyphrases
- arabic language
- web pages
- english language
- language learning
- machine translation system
- word forms
- natural language
- english text
- language identification
- target language
- word order
- morphological analysis
- website
- native language
- language specific
- parallel corpus
- language processing
- search engine
- web search engines
- programming language
- language independent
- source language
- web page classification
- comparable corpora
- text to speech
- machine translation
- digital libraries
- keywords
- indian languages
- web documents
- foreign language
- unknown words
- spoken language
- native speakers
- statistical machine translation
- character n grams
- link analysis
- syntactic categories
- natural language processing