Semi-supervised Information Extraction from Variable-length Web-page Lists.
Daniel NikovskiAlan EsentherAkihiro BabaPublished in: ICEIS (1) (2009)
Keyphrases
- variable length
- semi supervised
- information extraction
- web documents
- web pages
- named entity recognition
- fixed length
- website
- n gram
- ranking list
- semi supervised learning
- search engine
- pairwise
- natural language processing
- data extraction
- active learning
- labeled data
- extraction rules
- text mining
- information retrieval
- semi structured
- bitstream
- conditional random fields
- machine learning
- supervised learning
- named entities
- link analysis
- statistical dependencies
- convolutional codes
- web mining
- coding scheme
- knowledge discovery
- image segmentation
- computer vision
- run length encoding