A language independent approach to multilingual document representation including Arabic.
Souhila BouchamHassina AlianePublished in: AICCSA (2021)
Keyphrases
- language independent
- document representation
- cross lingual
- bag of words
- n gram
- document collections
- cross language
- text classification
- language model
- document clustering
- vector space model
- machine translation
- text documents
- language specific
- web documents
- text retrieval
- vector space
- data fusion
- language modeling
- semantic information
- text data
- word segmentation
- information retrieval
- information retrieval systems
- feature selection
- background knowledge
- document retrieval
- text mining
- data mining
- question answering
- clustering method
- image classification
- probabilistic model
- metadata