Local and Global Topics in Text Modeling of Web Pages Nested in Web Sites.
Jason WangRobert E. WeissPublished in: CoRR (2021)
Keyphrases
- web pages
- website
- keywords
- technical papers
- web documents
- text content
- text data
- html pages
- search engine
- web content
- information retrieval
- content features
- text mining
- key concepts
- web server
- hierarchical structure
- textual content
- plain text
- web logs
- dynamically generated
- web search engines
- scientific papers
- text corpora
- text collections
- text documents
- link structure
- news topics
- web information extraction
- text fragments
- link analysis
- web data
- free text
- topic modeling
- web users
- web browser
- textual data
- probabilistic topic models
- dynamic content
- related web pages
- topic detection