Simplified DOM Trees for Transferable Attribute Extraction from the Web.
Yichao ZhouYing ShengNguyen VoNick EdmondsSandeep TataPublished in: CoRR (2021)
Keyphrases
- website
- web pages
- data extraction
- web information extraction
- document object model
- tree structure
- web applications
- web content
- information sources
- semantic web
- decision trees
- information extraction
- web users
- attribute values
- web mining
- linked data
- data structure
- automatic extraction
- information overload
- database systems
- web scale
- dynamic content
- gain ratio
- data mining