Comparison of String Distance Metrics for Lemmatisation of Named Entities in Polish.
Jakub PiskorskiMarcin SydowKarol WielochPublished in: LTC (2007)
Keyphrases
- association rules
- named entities
- distance metric
- named entity recognition
- named entity extraction
- co occurrence
- news corpus
- euclidean distance
- distance metric learning
- relation extraction
- information extraction
- question answering
- text mining
- natural language processing
- distance measure
- distance function
- metric learning
- text documents
- annotated corpus
- person names
- unsupervised learning
- named entity disambiguation
- data points
- news articles
- information retrieval systems
- data analysis
- text corpus
- natural language
- clustering algorithm
- artificial intelligence
- neural network
- databases
- data sets