Identifying Historical Travelogues in Large Text Corpora Using Machine Learning.
Jan RördenDoris GruberMartin KricklBernhard HaslhoferPublished in: CoRR (2020)
Keyphrases
- text corpora
- machine learning
- text mining
- computational linguistics
- natural language processing
- text documents
- text classification
- topic models
- text classifiers
- information extraction
- document collections
- text analysis
- artificial intelligence
- data mining
- knowledge discovery
- learning algorithm
- topic modeling
- text collections
- feature space
- bag of words
- natural language
- document classification
- concept hierarchy
- data structure
- feature selection
- information retrieval