Synopsis Information Extraction in Documents Through Probabilistic Text Classifiers.
Jantima PolpinijAditya GhosePublished in: ICADL (2007)
Keyphrases
- text classifiers
- information extraction
- text classification
- labeled documents
- text categorization
- document classification
- text mining
- text documents
- training documents
- naive bayes
- unlabeled documents
- text data
- bayesian networks
- web documents
- machine learning
- probabilistic model
- natural language processing
- information retrieval
- named entities
- textual data
- knn
- feature selection
- structured data
- automatic text classification
- generative model
- decision trees
- neural network
- bag of words
- semi supervised learning
- uncertain data
- classification accuracy
- data sets
- nearest neighbor
- support vector machine
- semantic features
- digital libraries
- training data