Data Extraction via Semantic Regular Expression Synthesis.
Qiaochu ChenArko BanerjeeÇagatay DemiralpGreg DurrettIsil DilligPublished in: CoRR (2023)
Keyphrases
- data extraction
- regular expressions
- semi structured
- pattern matching
- web data extraction
- semistructured data
- semistructured databases
- data integration
- query language
- web pages
- information extraction
- semantic information
- information integration
- xml schema
- query interface
- domain specific
- semantic web
- databases
- approximate matching
- database
- deterministic finite automata
- end users
- decision making
- website
- data exchange
- association rules
- data model
- matching algorithm
- domain knowledge
- web documents
- data sources