Modern Tools for Old Content - in Search of Named Entities in a Finnish OCRed Historical Newspaper Collection 1771-1910.
Kimmo KettunenEetu MäkeläJuha KuokkalaTeemu RuokolainenJyrki NiemiPublished in: LWDA (2016)
Keyphrases
- named entities
- named entity recognition
- co occurrence
- named entity extraction
- information extraction
- relation extraction
- question answering
- text mining
- metadata
- natural language processing
- unsupervised learning
- data mining
- news corpus
- document collections
- text corpus
- personal names
- feature extraction
- information retrieval
- machine learning