Exploiting Transliterated Words for Finding Similarity in Inter-Language News Articles using Machine Learning.
Sameea NaeemArif Ur RahmanSyed Mujtaba HaiderAbdul Basit MughalPublished in: CoRR (2022)
Keyphrases
- named entities
- text documents
- news articles
- text mining
- information extraction
- machine learning
- natural language processing
- comparable corpora
- natural language
- related words
- keyphrases
- unsupervised learning
- text corpus
- newspaper articles
- online news
- news sites
- text corpora
- similarity measure
- statistical topic models
- language specific
- text classification
- bilingual dictionaries
- cross lingual
- news events
- semi supervised learning
- artificial intelligence
- target language
- news stories
- computer vision
- topic models
- data mining
- news items
- word pairs
- learning algorithm
- semantic similarity
- feature selection
- data analysis