Domain and Language Independent Feature Extraction for Statistical Text Categorization
Thomas BayerIngrid RenzMichael SteinUlrich KresselPublished in: CoRR (1996)
Keyphrases
- text categorization
- language independent
- text classification
- cross language
- feature selection
- feature extraction
- n gram
- knn
- transfer learning
- machine translation
- k nearest neighbor
- machine learning
- text documents
- text classifiers
- cross lingual
- text retrieval
- semi supervised learning
- dimensionality reduction
- text mining
- information theoretic
- support vector machine
- term frequency
- bag of words
- probabilistic model
- learning process
- tf idf
- neural network
- data sets