Login / Signup
BigBatch: a document processing platform for clusters and grids.
Giorgia de Oliveira Mattos
Rafael Dueire Lins
Andrei de Araújo Formiga
Fernando Mário Junqueira Martins
Published in:
SAC (2008)
Keyphrases
</>
document processing
document clustering
production line
digital libraries
document images
information retrieval
clustering algorithm
information extraction
cluster analysis
document analysis
text mining
text processing
textual documents
dynamic programming
databases
pattern recognition
knowledge extraction