Aligning Words in French-English Non-Parallel Medical Texts: Effect of Term Frequency Distributions.
Yun-Chuang ChiaoPierre ZweigenbaumPublished in: MedInfo (2004)
Keyphrases
- term frequency
- text documents
- english words
- document frequency
- text categorization
- tf idf
- text classification
- document representation
- text mining
- query words
- retrieval model
- bag of words
- natural language
- keywords
- information extraction
- term weighting
- wordnet
- n gram
- average precision
- document clustering
- topic models
- out of vocabulary
- machine translation
- cross language
- information gain
- vector space model
- cross lingual
- retrieval effectiveness
- information retrieval
- web documents
- semi supervised learning
- probabilistic model
- feature extraction
- feature selection