OXPath: A language for scalable data extraction, automation, and crawling on the deep web.
Tim FurcheGeorg GottlobGiovanni GrassoChristian SchallhartAndrew Jon SellersPublished in: VLDB J. (2013)
Keyphrases
- data extraction
- deep web
- query interface
- web pages
- web databases
- web sources
- web data extraction
- search engine
- data integration
- deep web data sources
- data sources
- web data
- databases
- semi structured
- website
- database
- information integration
- web documents
- query language
- user queries
- web search
- web content
- structured data
- natural language
- database server
- key technologies
- data records
- information extraction
- multimedia databases
- keywords
- natural language processing
- web server
- machine learning
- distributed systems