Digital Humanities in Cultural Areas Using Texts That Lack Word Spacing.
Kiyonori NagasakiToru TomabechiA. Charles MullerMasahiro ShimodaPublished in: DH (2016)
Keyphrases
- digital archiving
- social sciences
- english words
- natural language text
- digital resources
- digital archives
- co occurrence
- syntactic analysis
- linguistic information
- training corpus
- digital libraries
- n gram
- text segments
- text corpus
- punctuation marks
- text input
- cross cultural
- word sense
- keywords
- word recognition
- world knowledge
- part of speech
- newspaper articles
- word sense disambiguation
- linked data
- ambiguous words
- text mining
- metadata