Information Extraction from the Structured Part of Office Documents.
Xiaolong HaoJason Tsong-Li WangPeter A. NgPublished in: Inf. Sci. (1996)
Keyphrases
- information extraction
- free text
- structured data
- text documents
- unstructured text
- web documents
- information retrieval
- unstructured documents
- text mining
- structured and unstructured data
- text analysis
- natural language text
- textual data
- structured information
- natural language processing
- question answering
- information extraction systems
- semi structured
- precision and recall
- xml documents
- metadata
- document retrieval
- digital documents
- information retrieval systems
- machine learning
- document collections
- document clustering
- named entities
- relevant documents
- structured queries
- multimedia documents
- retrieval systems
- electronic documents
- unstructured data
- text processing
- relation extraction
- relational learning
- conditional random fields
- document representation
- vector space model
- document classification
- named entity recognition
- legal documents
- office environment
- semantic information
- digital libraries
- machine translation
- web mining
- textual information
- natural language
- query terms