Extracting Structured Data from Web Pages with Maximum Entropy Segmental Markov Model.
Susan MengelYaoquin JingPublished in: WISE (2009)
Keyphrases
- structured data
- maximum entropy
- markov model
- markov models
- web pages
- hidden markov models
- structured information
- markov chain
- semi structured
- conditional random fields
- unstructured information
- information extraction
- unstructured data
- data sources
- search engine
- statistical model
- xml documents
- keywords
- markov networks
- web documents
- metadata
- databases
- probabilistic model
- pairwise
- maximum likelihood
- class labels
- prior knowledge
- feature selection
- data mining