Modified Self-organizing Maps for Line Extraction in Digitized Text Documents.
Juan Manuel Alonso-WeberInés María GalvánAraceli Sanchis de MiguelPublished in: Applied Informatics (2003)
Keyphrases
- self organizing maps
- text documents
- line extraction
- text mining
- text classification
- text categorization
- neural network
- wordnet
- hough transform
- information extraction
- topic models
- keywords
- text analysis
- unsupervised learning
- morphological operations
- document images
- input data
- named entities
- bag of words
- machine learning
- k means
- training data
- input image
- multiscale
- knowledge base
- feature selection