Towards building a Urdu Language Corpus using Common Crawl.
Hafiz Muhammad ShafiqBilal TahirMuhammad Amir MehmoodPublished in: J. Intell. Fuzzy Syst. (2020)
Keyphrases
- spanish language
- parallel corpus
- natural language
- web search
- specification language
- linguistic patterns
- open domain
- language learning
- programming language
- manually annotated
- sentiment analysis
- text retrieval
- target language
- machine learning
- statistical machine translation
- machine translation system
- language identification
- machine translation
- spoken dialog
- search engine