Focused Web Corpus Crawling.
Roland SchäferAdrien BarbaresiFelix BildhauerPublished in: WaC@EACL (2014)
Keyphrases
- focused crawler
- web pages
- website
- web mining
- web crawling
- search engine
- web crawlers
- focused crawling
- web crawler
- semantic web
- web applications
- web information retrieval
- web graph
- web documents
- web resources
- link analysis
- database
- web data
- web content
- social media
- newspaper articles
- web search
- linked data
- web information
- end users
- meta search
- plain text
- information sources
- textual features