CATI : une approche interactive de découverte et de classification de grands corpus de documents.
Cédric BoscherElöd Egyed-ZsigmondSylvie CalabrettoPublished in: EGC (2022)
Keyphrases
- document classification
- classification accuracy
- feature extraction
- automatic classification
- information retrieval
- pre classified
- supervised machine learning
- support vector machine svm
- supervised learning
- classification algorithm
- support vector
- automatic categorization
- support vector machine
- document collections
- image classification
- machine learning
- text classification
- text documents
- document retrieval
- information retrieval systems
- text data
- person names
- feature vectors
- textual features
- newspaper articles
- feature selection
- text classifiers
- parallel corpora
- training corpus
- multiword
- keywords
- document clustering
- web documents