Semi-automatic Document Classification: Exploiting Document Difficulty.
Miguel Martinez-AlvarezSirvan YahyaeiThomas RoellekePublished in: ECIR (2012)
Keyphrases
- document classification
- semi automatic
- fully automatic
- text categorization
- web documents
- topic extraction
- text classification
- text documents
- semi automatically
- gold standard
- classification algorithm
- text mining
- automatic document classification
- ontology mapping
- wrapper generation
- knn
- semi supervised
- high dimensional
- landmark extraction
- data analysis