Information Extraction from Presentation-Oriented Documents.
Massimo RuffoloErmelinda OroPublished in: ERCIM News (2012)
Keyphrases
- information extraction
- text documents
- free text
- web documents
- unstructured documents
- information retrieval
- text analysis
- natural language text
- document collections
- natural language processing
- text mining
- information retrieval systems
- metadata
- document clustering
- textual data
- named entities
- question answering
- unstructured text
- electronic documents
- multimedia
- relation extraction
- document classification
- xml documents
- keywords
- legal documents
- information extraction systems
- semi structured
- precision and recall
- web mining
- conditional random fields
- document representation
- document retrieval
- relevant documents
- machine learning
- vector space
- vector space model
- relational learning
- machine translation
- text corpora
- retrieval systems
- digital documents
- semantic information
- structured data