Multinomial Mixture Modelling for Bilingual Text Classification.
Jorge CiveraAlfons JuanPublished in: PRIS (2006)
Keyphrases
- text classification
- cross lingual
- text categorization
- naive bayes
- text mining
- bag of words
- feature selection
- machine learning
- text documents
- logit model
- n gram
- text data
- text classifiers
- sentiment analysis
- document classification
- multi label
- knn
- cross language
- parallel corpora
- machine translation
- language modeling
- expectation maximization
- mixture model
- semantic features
- labeled data
- word alignment
- chinese english
- information theoretic
- cross language information retrieval
- term frequency
- data cleaning
- probabilistic model
- information retrieval