Amilcare: adaptive information extraction for document annotation.
Fabio CiravegnaAlexiei DingliYorick WilksDaniela PetrelliPublished in: SIGIR (2002)
Keyphrases
- information extraction
- text documents
- web documents
- information retrieval
- natural language processing
- unstructured documents
- precision and recall
- text summarization
- text mining
- information retrieval systems
- document classification
- free text
- structured data
- retrieval systems
- document clustering
- machine learning
- semantic annotation
- social annotations
- named entity recognition
- semi structured
- metadata
- active learning
- document images
- question answering
- document collections
- web mining
- vector space model
- language model
- document similarity
- text corpus
- automatic image annotation
- relation extraction
- text categorization
- tf idf
- image annotation
- relevant documents
- knowledge discovery