Local and global topics in text modeling of web pages nested in web sites.
Jason WangRobert E. WeissPublished in: Comput. Stat. Data Anal. (2022)
Keyphrases
- web pages
- website
- keywords
- technical papers
- text content
- web documents
- text data
- information retrieval
- search engine
- textual content
- html pages
- web browser
- web content
- web server
- key concepts
- scientific papers
- hierarchical structure
- link analysis
- web users
- content features
- text documents
- hyperlink structure
- news stories
- dynamically generated
- related web pages
- data extraction
- plain text
- topic detection
- topic specific
- web logs
- web search
- web graph
- text collections
- probabilistic topic models
- log files
- free text
- information extraction