Chinese and Vietnamese cross-lingual topic discovery based on word similarity of comparable corpus.
Zhengtao YuLinjie XiaPeili TangXiaocong WangShengxiang GaoPublished in: Int. J. Inf. Commun. Technol. (2024)
Keyphrases
- mono lingual
- cross lingual
- topic discovery
- text classification
- text analysis
- topic models
- machine translation
- language modeling
- latent dirichlet allocation
- bag of words
- text categorization
- statistical machine translation
- text mining
- document representation
- question answering
- text documents
- text retrieval
- document clustering
- information retrieval
- n gram
- feature selection
- artificial intelligence
- named entities
- text data
- wordnet
- high dimensional