Efficient Removal of Noisy Borders of Monochromatic Documents.
Andrei de Araújo FormigaRafael Dueire LinsPublished in: ICIAR (2009)
Keyphrases
- information retrieval
- xml documents
- cost effective
- document collections
- legal documents
- information retrieval systems
- web documents
- database
- text documents
- document clustering
- bag of words
- document analysis
- noisy environments
- multi document summarization
- vector space model
- free text
- computationally expensive
- information extraction
- keywords
- multimedia
- neural network
- data sets