Latent Dirichlet Allocation Based Semantic Clustering of Heterogeneous Deep Web Sources.
Umara NoorAli DaudAyesha ManzoorPublished in: INCoS (2013)
Keyphrases
- latent dirichlet allocation
- web sources
- topic models
- topic discovery
- probabilistic latent semantic analysis
- topic modeling
- information integration
- generative model
- lda model
- data extraction
- text mining
- information sources
- web data
- clustering algorithm
- semi structured
- multiple sources
- data sources
- co occurrence
- latent semantic analysis
- website
- probabilistic model
- data sets
- k means
- deep web
- semantic information
- web mining
- text documents
- data analysis
- semantic features
- high dimensional data
- dimensionality reduction
- data points
- search engine
- machine learning