A feature selection method for document clustering based on part-of-speech and word co-occurrence.
Zitao LiuWenchao YuYalan DengYongtao WangZhiqi BianPublished in: FSKD (2010)
Keyphrases
- part of speech
- word co occurrence
- co occurrence
- keyword extraction
- keyphrase extraction
- text documents
- n gram
- keywords
- noun phrases
- natural language processing
- related words
- keyphrases
- training corpus
- word sense disambiguation
- tf idf
- text retrieval
- wordnet
- document clustering
- information retrieval
- text summarization
- named entities
- semantic content
- data mining
- language independent
- news articles
- retrieval systems
- question answering
- text mining
- information extraction
- unsupervised methods
- probabilistic model
- artificial intelligence