Statistically Valid Links and Anti-links BetweenWords and Between Documents: Applying TourneBool Randomization Test to a Reuters Collection.
Alain LeluMartine CadotPublished in: EGC (best of volume) (2009)
Keyphrases
- document collections
- text documents
- database
- text categorization
- automatic categorization
- link analysis
- text data
- text classification
- free text
- information retrieval
- document clustering
- user queries
- web mining
- web documents
- k nearest neighbor
- document classification
- text mining
- information extraction
- document analysis
- knn