Extracting Structural Features Among Words from Document Data Streams.
Kumiko IshidaTomoyuki UchidaKayo KawamotoPublished in: Australian Conference on Artificial Intelligence (2006)
Keyphrases
- structural features
- data streams
- text documents
- text lines
- keywords
- structural information
- data sets
- linguistic features
- semantic features
- feature set
- document images
- document collections
- noun phrases
- secondary structure
- information retrieval systems
- information retrieval
- semantic information
- n gram
- document clustering
- sentiment analysis
- word sense disambiguation
- web documents
- clustering method
- text classification
- text mining
- feature selection
- textural information