A Comparative Study on Data Representation to Categorize Text Documents.
D. A. MeedeniyaAmal Shehan PereraPublished in: SEKE (2008)
Keyphrases
- data representation
- text documents
- text mining
- information extraction
- text classification
- topic models
- text categorization
- bag of words
- keywords
- data representations
- dimensionality reduction
- document clustering
- named entities
- wordnet
- neural network
- xml documents
- automatic text categorization
- metadata
- machine learning
- data sets
- co occurrence
- building blocks
- probabilistic model
- structured data
- image processing
- search engine
- artificial intelligence
- xml schema