Login / Signup

A regular expression generator based on CSS selectors for efficient extractionfrom HTML pages.

Erdinç Uzun
Published in: Turkish J. Electr. Eng. Comput. Sci. (2020)
Keyphrases
  • regular expressions
  • pattern matching
  • html pages
  • machine learning
  • approximate matching
  • information retrieval
  • web pages
  • data analysis
  • low level
  • semi structured
  • xml schema
  • automatic extraction
  • data extraction