A Hierarchical Text Rating System for Objectionable Documents.
Chi Yoon JeongSeung Wan HanTaek Yong NamPublished in: J. Inf. Process. Syst. (2005)
Keyphrases
- text content
- text documents
- information retrieval
- free text
- digital documents
- web documents
- textual content
- text analysis
- document analysis
- text collections
- plagiarism detection
- keywords
- text data
- document processing
- text information
- latent semantic analysis
- document content
- text clustering
- newspaper articles
- electronic documents
- text retrieval
- textual information
- text segments
- textual documents
- document collections
- text mining
- printed documents
- multimedia documents
- text corpus
- topic segmentation
- document clustering
- document structure
- linguistic analysis
- automatic categorization
- spoken documents
- topic modeling
- document categorization
- textual data
- document retrieval
- retrieval engine
- handwritten text
- digital libraries
- scientific documents
- text classifiers
- natural language text
- information extraction
- information retrieval systems
- recommender systems
- text corpora
- semantic content
- key concepts
- natural language processing
- page layout
- semantic information
- text categorization
- test collection
- xml documents
- metadata
- sentence level
- web pages
- topic hierarchy
- document repositories
- document set
- text classification
- search engine
- journal articles
- collaborative filtering
- relevant documents
- related documents
- text summarization
- document level