Smart Qualitative Data (SQUAD): Information Extraction in a Large Document Archive.
Maria MilosavljevicClaire GroverLouise CortiPublished in: RIAO (2007)
Keyphrases
- information extraction
- web documents
- text documents
- information retrieval
- unstructured documents
- text mining
- natural language processing
- document processing
- text summarization
- document clustering
- precision and recall
- document collections
- information retrieval systems
- document images
- document retrieval
- document classification
- machine learning
- retrieval systems
- database
- knowledge discovery
- free text
- tf idf
- cross document
- relation extraction
- structured documents
- smart environments
- natural language text
- open domain
- ontology based information extraction
- web mining
- machine translation
- conditional random fields
- natural language
- document representation
- relational learning
- semi structured
- text processing
- multimedia documents
- data extraction
- document analysis
- structured data
- question answering
- digital documents
- generative model
- multimedia