A Low-Storage-Consumption XML Labeling Method for Efficient Structural Information Extraction.
Wenxin LiangAkihiro TakahashiHaruo YokotaPublished in: DEXA (2009)
Keyphrases
- information extraction
- detection method
- significant improvement
- clustering method
- segmentation method
- semi structured
- cost function
- dynamic programming
- computational cost
- high precision
- high accuracy
- unsupervised learning
- computationally efficient
- precision and recall
- databases
- data exchange
- support vector machine
- preprocessing
- image segmentation
- information retrieval