Automatic Regular Expression Generation for Extracting Relevant Image Data From Web Pages Using Genetic Algorithms.
Canan AslanyürekTarik YerlikayaPublished in: IEEE Access (2024)
Keyphrases
- regular expressions
- image data
- web pages
- pattern matching
- finite automata
- data extraction
- website
- search engine
- static analysis
- web documents
- query language
- string matching
- web search engines
- matching algorithm
- integrity constraints
- query evaluation
- keywords
- semistructured databases
- semistructured data
- approximate matching
- semi structured
- link analysis
- cost model
- web content
- database
- information retrieval