Unsupervised Parallel Corpus Mining on Web Data.
Guokun LaiZihang DaiYiming YangPublished in: CoRR (2020)
Keyphrases
- web data
- web mining
- parallel corpus
- web usage mining
- semi structured
- cross lingual
- web content
- web logs
- web pages
- text mining
- cross language information retrieval
- machine translation system
- link structure
- data mining techniques
- social network analysis
- data mining
- information extraction
- web documents
- digital libraries
- knowledge discovery
- language independent
- artificial intelligence
- unsupervised manner
- data analysis
- databases