Web information mining and semantic analysis in heterogeneous unstructured text data using enhanced latent Dirichlet allocation.
Madamanchi VenugopalVirendra K. SharmaKalpana SharmaPublished in: Concurr. Comput. Pract. Exp. (2023)
Keyphrases
- text mining
- semantic analysis
- text data
- latent dirichlet allocation
- web information
- web mining
- natural language processing
- topic models
- semi structured
- web data
- topic modeling
- text documents
- information extraction
- text classification
- structured data
- natural language
- semantic information
- web pages
- website
- knowledge discovery
- information retrieval
- machine learning
- named entities
- data mining
- text corpora
- search engine
- data sets
- wordnet
- generative model
- artificial intelligence
- web content
- domain knowledge
- high dimensional data
- document collections
- nearest neighbor
- probabilistic model
- relational databases
- data analysis
- pattern recognition
- decision trees