Extracting Content from Web Pages Using the Sliding Window.
Liu YangChunping LiMing GuPublished in: CSA (2009)
Keyphrases
- sliding window
- web pages
- data streams
- web content
- web documents
- textual content
- fixed size
- website
- dynamic content
- variable size
- window size
- data extraction
- web resources
- stream data
- limited memory
- search engine
- browsing experience
- streaming data
- web search
- data records
- dynamically generated
- link analysis
- continuous queries
- web portals
- content features
- data mining
- window sizes
- pattern matching
- space efficient
- web mining
- data structure
- html pages
- semi structured
- web data