News Across Languages - Cross-Lingual Document Similarity and Event Tracking.
Jan RupnikAndrej MuhicGregor LebanPrimoz SkrabaBlaz FortunaMarko GrobelnikPublished in: CoRR (2015)
Keyphrases
- cross lingual
- news articles
- document similarity
- document clustering
- document representation
- text documents
- news stories
- cross lingual information retrieval
- online news
- text classification
- machine translation
- vector space model
- language modeling
- text mining
- semantic similarity
- latent dirichlet allocation
- clustering method
- text categorization
- wordnet
- query expansion
- information retrieval systems
- relevance model
- feature selection