French and German Corpora for Audience-based Text Type Classification.
Amalia TodirascuSebastian PadóJennifer KrischMax KisselewUlrich HeidPublished in: LREC (2012)
Keyphrases
- supervised machine learning
- text data
- classification accuracy
- pattern recognition
- machine learning
- text classification
- pattern classification
- classification scheme
- automatic classification
- image classification
- support vector machine
- support vector
- database
- text corpora
- decision rules
- support vector machine svm
- linguistic patterns
- text mining
- natural language processing
- supervised learning
- feature vectors
- keywords
- decision trees
- data mining
- classification algorithm
- text documents
- information extraction
- document classification
- active learning
- feature space
- text corpus
- topic segmentation