A Novel Web Scraping Approach Using the Additional Information Obtained From Web Pages.
Erdinç UzunPublished in: IEEE Access (2020)
Keyphrases
- web pages
- website
- web users
- web content
- web documents
- web data
- search engine
- web search
- data extraction
- dynamically generated
- web search engines
- link analysis
- web resources
- web information extraction
- classifying web pages
- google search engine
- keywords
- web browsing
- dynamic content
- web applications
- web graph
- deep web
- web sources
- web communities
- social bookmarking
- web browser
- web portals
- hyperlink structure
- web spam
- user access patterns
- page content
- information sources
- semantic web
- web mining
- web logs
- web server