A Language-Independent Approach to Identify the Named Entities in Under-Resourced Languages and Clustering Multilingual Documents.
N. Kiran KumarG. S. K. SantoshVasudeva VarmaPublished in: CLEF (2011)
Keyphrases
- language independent
- named entities
- multilingual documents
- machine translation
- information extraction
- natural language processing
- named entity recognition
- unsupervised learning
- cross lingual
- text mining
- n gram
- clustering algorithm
- target language
- co occurrence
- question answering
- text classification
- text retrieval
- cross language
- text documents
- cross language information retrieval
- word level
- automatic summarization
- natural language
- wordnet
- data points
- k means
- training data
- word segmentation
- machine learning
- information retrieval
- text summarization
- dimensionality reduction
- databases