Document Similarity for Arabic and Cross-Lingual Web Content.
Ali SalhiAdnan H. YahyaPublished in: ICALP (2017)
Keyphrases
- web content
- cross lingual
- document similarity
- document clustering
- document representation
- website
- language modeling
- web documents
- machine translation
- web pages
- relevance model
- text classification
- clustering method
- vector space model
- document collections
- clustering algorithm
- text mining
- information retrieval
- transfer learning
- language model
- information extraction
- text documents
- semantic similarity
- social media
- news articles