Experiments on Sentence Boundary Detection in User-Generated Web Content.
Roque Enrique López CondoriThiago Alexandre Salgueiro PardoPublished in: CICLing (1) (2015)
Keyphrases
- user generated
- boundary detection
- web content
- website
- natural language
- image segmentation
- web pages
- web documents
- web data
- text content
- user interests
- detection algorithm
- berkeley segmentation dataset
- social media
- semantic browsing
- web resources
- sentence level
- product reviews
- data mining
- information extraction
- text corpus