Web Data Extraction from Scientific Publishers' Website Using Hidden Markov Model.
Jing HuangZiyu LiuBeibei WangMingyue DuanBo YangPublished in: KSEM (1) (2018)
Keyphrases
- hidden markov models
- web data extraction
- website
- web pages
- data extraction
- semi structured
- sequential data
- speech recognition
- gesture recognition
- web content
- markov models
- conditional random fields
- hidden state
- viterbi algorithm
- baum welch
- search engine
- markov model
- web server
- discriminative training
- hidden states
- hierarchical hidden markov model
- web users
- kernel density
- meta search engine
- database
- data integration
- data model
- metadata
- information retrieval
- machine learning