Improving the web text content by extracting significant pages into a Web Site.
Sebastián A. RíosJuan D. VelásquezEduardo S. VeraHiroshi YasudaTerumasa AokiPublished in: ISDA (2005)
Keyphrases
- text content
- web pages
- website
- data extraction
- web users
- web documents
- search engine
- web content
- focused crawling
- web search
- keywords
- web server
- web search engines
- link analysis
- web data
- hyperlink structure
- home page
- web logs
- browsing behavior
- web access logs
- web graph
- web usage mining
- topic specific
- user generated
- dynamic content
- link structure
- machine learning