LumberChunker: Long-Form Narrative Document Segmentation.
André V. DuarteJoão MarquesMiguel GraçaMiguel FreireLei LiArlindo L. OliveiraPublished in: CoRR (2024)
Keyphrases
- image segmentation
- segmentation method
- information retrieval systems
- segmentation algorithm
- page segmentation
- database
- multiscale
- fully automatic
- information retrieval
- edge detection
- medical images
- image analysis
- level set
- keywords
- document classification
- markov random field
- document collections
- text documents
- region growing
- object segmentation
- structured documents
- segmented images