Hiencor: On mining of a hi-en general purpose parallel corpus from the web.
Arjun DasUtpal GarainRavindra KumarApurbalal SenapatiPublished in: IALP (2017)
Keyphrases
- general purpose
- parallel corpus
- web mining
- web data
- cross lingual
- text mining
- machine translation
- data mining
- text categorization
- user experience
- web pages
- link analysis
- user generated content
- web documents
- natural language processing
- probabilistic model
- keywords
- cross language information retrieval
- language independent
- artificial intelligence