Efficient Search in Hidden Text of Large DjVu Documents.
Janusz S. BienPublished in: NLP4DL/AT4DL (2009)
Keyphrases
- efficient search
- text documents
- digital documents
- information retrieval
- web documents
- free text
- document analysis
- keywords
- text mining
- text retrieval
- text collections
- textual content
- document content
- latent semantic analysis
- similarity search
- multimedia documents
- search problems
- document retrieval
- document collections
- electronic documents
- information extraction
- relevant documents
- information retrieval systems
- semantic information
- data sets
- related documents
- text lines
- document clustering
- database
- user queries
- query expansion
- text classification
- vector space
- document images
- co occurrence
- query terms
- xml documents
- pattern recognition
- search algorithm
- neural network