Old Content and Modern Tools - Searching Named Entities in a Finnish OCRed Historical Newspaper Collection 1771-1910.
Kimmo KettunenEetu MäkeläTeemu RuokolainenJuha KuokkalaLaura LöfbergPublished in: Digit. Humanit. Q. (2017)
Keyphrases
- named entities
- pdf files
- named entity extraction
- named entity recognition
- information extraction
- text mining
- co occurrence
- question answering
- natural language processing
- text corpus
- relation extraction
- machine learning
- text documents
- annotated corpus
- unsupervised learning
- person names
- news corpus
- metadata
- web news
- multimedia
- news articles
- document collections
- wordnet