GutenTag: an NLP-driven Tool for Digital Humanities Research in the Project Gutenberg Corpus.
Julian BrookeAdam HammondGraeme HirstPublished in: CLfL@NAACL-HLT (2015)
Keyphrases
- natural language processing
- information extraction
- coreference resolution
- natural language
- digital museum
- digital libraries
- free text
- data driven
- machine learning
- question answering
- software projects
- project management
- machine translation
- software development
- software engineering
- test set
- data collection
- text mining
- social sciences
- text analysis
- hand crafted
- digital resources
- broad coverage
- case study