Large-Scale Linguistic Ontology as a Basis for Text Categorization of Legislative Documents.
Natalia V. LoukachevitchBoris V. DobrovPublished in: JURIX (2005)
Keyphrases
- text categorization
- text documents
- automatic text categorization
- automatic categorization
- document classification
- training documents
- document categorization
- text classifiers
- text collections
- text representation
- term frequency
- feature selection
- term selection
- text clustering
- text classification
- knn
- classify documents
- k nearest neighbor
- multi label
- distributional clustering
- information gain
- document clustering
- domain ontology
- term weighting
- document frequency
- knowledge representation
- semantic information
- semi supervised learning
- natural language
- word frequency
- information retrieval
- web documents
- document collections
- machine learning
- text data
- feature space
- unsupervised learning
- data mining
- tf idf