An effective method for finding best entry points in semi-structured documents.
Eugen PopoviciPierre-Francois MarteauGildas MénierPublished in: SIGIR (2007)
Keyphrases
- high accuracy
- detection method
- significant improvement
- cost function
- classification method
- dynamic programming
- experimental evaluation
- segmentation method
- entry points
- similarity measure
- high precision
- synthetic data
- clustering method
- computationally efficient
- preprocessing
- segmentation algorithm
- computational cost
- pairwise
- objective function