Text Classification Based on the Heterogeneous Graph Considering the Relationships between Documents.
Hiromu NakajimaMinoru SasakiPublished in: Big Data Cogn. Comput. (2023)
Keyphrases
- text classification
- text documents
- document classification
- labeled documents
- text classifiers
- training documents
- text data
- term frequency
- semantic relationships
- text categorization
- text mining
- document categorization
- bag of words
- training corpus
- feature selection
- classify documents
- document collections
- semantic features
- naive bayes
- automatic text classification
- sentiment analysis
- graph theory
- directed graph
- n gram
- information retrieval
- document retrieval
- heterogeneous collections
- related documents
- relevant documents
- document clustering
- information retrieval systems
- document representation
- sentiment classification
- distributional clustering
- labeled data
- unlabeled data
- structured data
- bipartite graph
- web documents
- machine learning
- multi label
- semantic information
- random walk
- metadata
- information extraction
- query terms
- search engine
- structural patterns
- keywords
- xml documents
- data cleaning
- multi document summarization
- document set
- graph structure
- language modeling
- weighted graph