A Robust Two Level Classification Algorithm for Text Localization in Documents.
R. KandanNirup Kumar ReddyK. R. ArvindA. G. RamakrishnanPublished in: ISVC (2) (2007)
Keyphrases
- classification algorithm
- document classification
- text documents
- information retrieval
- web documents
- free text
- knn
- k nearest neighbor
- text retrieval
- digital documents
- training set
- keywords
- textual content
- training phase
- learning algorithm
- support vector machine
- naive bayes
- document content
- plagiarism detection
- accurate classification
- latent semantic analysis
- document clustering
- document analysis
- xml documents
- text mining
- class labels
- classification rules
- database
- document collections
- multimedia documents
- text collections
- classification method
- neural network
- text classification
- concept drift
- text categorization
- electronic documents
- relevant documents
- training data
- semantic information
- image processing
- clustering algorithm