A Gazetteer and Georeferencing for Historical English Documents.
Claire GroverRichard TobinPublished in: LaTeCH@EACL (2014)
Keyphrases
- person names
- named entities
- historical documents
- document collections
- information retrieval
- information retrieval systems
- linguistic analysis
- english language
- historical manuscripts
- source language
- document retrieval
- natural language
- user queries
- xml documents
- manually constructed
- named entity recognition
- text documents
- relevant documents
- machine translation
- document clustering
- stop words
- historical data
- web documents
- retrieval systems
- parallel corpora
- arabic language
- keywords
- indian languages
- machine learning
- answer questions
- cross language
- vector space model
- cross lingual
- multiple sources
- query terms
- natural language processing
- information extraction
- target language
- retrieved documents
- metadata
- semantic information