Dirichlet-Hawkes Processes with Applications to Clustering Continuous-Time Document Streams.
Nan DuMehrdad FarajtabarAmr AhmedAlexander J. SmolaLe SongPublished in: KDD (2015)
Keyphrases
- document clustering
- clustering algorithm
- real time
- stochastic processes
- cluster analysis
- document images
- cluster membership
- topic discovery
- text clustering
- tolerance rough set
- information retrieval
- data streams
- document classification
- state space
- markov chain
- k means
- clustering analysis
- categorical data
- text documents
- clustering method
- unsupervised learning
- multiple data streams
- document retrieval
- self organizing maps
- hierarchical clustering
- boundary conditions
- information retrieval systems
- text mining
- reinforcement learning
- em algorithm
- cosine similarity
- query processing
- document collections
- high dimensional data
- spectral clustering
- data points
- process model
- retrieval systems