The Automatic Extraction of Web Information Based on Regular Expression.
Ji LiGuangyu JiangAijun XuYunzhen WangPublished in: J. Softw. (2017)
Keyphrases
- automatic extraction
- web information
- regular expressions
- pattern matching
- web data
- web mining
- website
- information filtering
- finite automata
- web content
- search engine
- web pages
- query language
- semistructured data
- xml schema
- deterministic finite automata
- deep web
- database
- query evaluation
- information sources
- matching algorithm
- domain specific
- data mining
- databases
- data sets