Automatic web page segmentation and information extraction using conditional random fields.
Yunfei GongQiang LiuPublished in: CSCWD (2012)
Keyphrases
- information extraction
- named entity recognition
- fully automatic
- web pages
- page segmentation
- web documents
- segmentation algorithm
- website
- natural language processing
- statistical classification
- segmentation method
- medical images
- precision and recall
- web page classification
- semi automatic
- machine learning
- text mining
- free text
- image segmentation
- conditional random fields
- region growing
- multiscale
- object segmentation
- level set
- shape prior
- segmentation accuracy
- web information extraction
- segmented images
- data extraction
- information retrieval
- web server
- semi structured
- web mining
- structured data
- natural language
- named entities
- fully unsupervised
- features extraction
- question answering
- edge detection