Using Topic Modelling to Explore Authors' Research Fields in a Corpus of Historical Scientific English.
Stefan FischerJörg KnappenElke TeichPublished in: DH (2018)
Keyphrases
- scientific papers
- link grammar
- writing style
- statistical machine translation
- person names
- broad coverage
- open domain
- parallel corpus
- natural language
- wide coverage
- training corpus
- text corpora
- machine translation
- topic models
- citation networks
- unknown words
- topic segmentation
- historical data
- artificial intelligence
- english language
- scientific data
- topic detection and tracking
- mono lingual
- penn treebank
- conversational speech
- english words
- word level
- semantic roles
- chinese english
- comparable corpora
- topic tracking
- science education
- document corpus
- document level
- broadcast news
- scientific literature
- machine translation system