English and Chinese bilingual topic aspect classification: Exploring similarity measures, optimal LSA dimensions, and centroid correction of translated training examples.
Yejun WuDouglas W. OardPublished in: ASIST (2013)
Keyphrases
- training examples
- training set
- supervised learning
- training phase
- class labels
- support vector
- training samples
- small number
- chinese english
- similarity measure
- english chinese
- active learning
- training data
- machine translation
- machine translation system
- decision trees
- cross lingual
- learning algorithm
- feature vectors
- sufficient training data
- machine learning
- target language
- cross language
- object categories
- image classification
- unsupervised learning
- prior domain knowledge
- text classification
- query translation
- source language
- feature space
- domain theory
- feature extraction
- labeled data
- negative training examples
- positive examples
- class distribution
- support vector machine
- document collections
- unlabeled data
- statistical machine translation
- cross language information retrieval
- parallel corpora
- sentence pairs
- wordnet
- monolingual retrieval