Wikipedia-based cross-language text classification.
Marcos Mouriño-GarcíaRoberto Pérez-RodríguezLuis E. Anido-RifónPublished in: Inf. Sci. (2017)
Keyphrases
- text classification
- cross language
- text categorization
- cross lingual
- explicit semantic analysis
- document collections
- bag of words
- text retrieval
- question answering
- document retrieval
- named entities
- multi label
- semantic features
- feature selection
- cross language information retrieval
- information access
- genre classification
- text mining
- text documents
- language modeling
- query translation
- spoken document retrieval
- knowledge base
- labeled data
- knn
- machine learning
- wordnet
- information retrieval systems
- information extraction
- test collection
- unlabeled data
- unsupervised learning
- wikipedia articles
- k nearest neighbor
- textual and visual information
- semantic relations
- information retrieval
- link structure
- document clustering
- semantic information
- knowledge discovery
- multimedia