Historical corpora meet the digital humanities: the Jerusalem Corpus of Emergent Modern Hebrew.
Aynat RubinsteinPublished in: Lang. Resour. Evaluation (2019)
Keyphrases
- text corpora
- computational linguistics
- document corpus
- parallel corpus
- digital resources
- annotated corpus
- text data
- statistical machine translation
- natural language processing
- topic segmentation
- historical data
- text corpus
- wide coverage
- digital libraries
- training corpus
- social sciences
- text collections
- linguistic patterns
- specific domains
- information seeking
- sentence pairs
- manually annotated
- word frequency
- multiword
- machine learning
- test set
- text classification
- text mining
- metadata
- information retrieval