Multilingual news clustering: Feature translation vs. identification of cognate named entities.
Soto MontalvoRaquel Martínez-UnanueArantza CasillasVíctor FresnoPublished in: Pattern Recognit. Lett. (2007)
Keyphrases
- named entities
- unsupervised learning
- named entity recognition
- named entity extraction
- information extraction
- co occurrence
- text mining
- natural language processing
- relation extraction
- k means
- clustering algorithm
- question answering
- machine translation
- news corpus
- chinese named entity recognition
- document clustering
- global context
- real world
- annotated corpus
- noun phrases
- person names
- databases
- data sets
- text documents
- text classification
- graphical models
- feature set
- supervised learning
- pattern recognition
- training data
- image segmentation
- learning algorithm
- personal names