Enabling information extraction by inference of regular expressions from sample entities.
Falk BrauerRobert RiegerAdrian MocanWojciech M. BarczynskiPublished in: CIKM (2011)
Keyphrases
- regular expressions
- information extraction
- named entities
- pattern matching
- xml schema
- entity resolution
- finite automata
- query language
- relational learning
- natural language processing
- semistructured data
- regular languages
- text mining
- information retrieval
- relation extraction
- semi structured
- deterministic finite automata
- tree automata
- regular path queries
- context free grammars
- sample size
- query evaluation
- matching algorithm
- structured data
- co occurrence
- natural language
- database systems
- efficient learning
- grammatical inference
- web documents