The Web Data Commons Schema.org Table Corpora.
Ralph PeetersAlexander BrinkmannChristian BizerPublished in: WWW (Companion Volume) (2024)
Keyphrases
- web data
- semi structured data
- web mining
- semistructured data
- semi structured
- database
- data model
- web content
- web information
- web pages
- database schema
- databases
- web documents
- html documents
- incremental mining
- web usage mining
- deep web
- web crawling
- web information extraction
- xml schema
- query logs
- natural language processing
- link structure
- xml data
- data mining
- natural language
- search engine