The Organization and Visualization of Document Corpora: A Probabilistic Approach.
Mark A. GirolamiAlexei VinokourovAta KabánPublished in: DEXA Workshop (2000)
Keyphrases
- document corpus
- information retrieval
- data analysis
- bayesian networks
- document images
- document clustering
- natural language processing
- document classification
- document collections
- text collections
- document retrieval
- text corpus
- keywords
- information systems
- data visualization
- web documents
- text documents
- retrieval systems
- word frequency
- probabilistic model
- database
- interactive visualization
- probabilistic retrieval
- structured documents
- vector space model
- self organizing maps
- user queries
- information retrieval systems
- semi supervised
- data mining