Scholarly Document Information Extraction using Extensible Features for Efficient Higher Order Semi-CRFs.
Nguyen Viet CuongMuthu Kumar ChandrasekaranMin-Yen KanWee Sun LeePublished in: JCDL (2015)
Keyphrases
- higher order
- information extraction
- conditional random fields
- named entity recognition
- information retrieval
- text documents
- natural images
- web documents
- contextual features
- sequence labeling
- markov random field
- precision and recall
- high order
- free text
- feature vectors
- semantic information
- digital libraries
- feature extraction
- question answering
- text mining
- object oriented
- pairwise
- relation extraction
- crf model
- feature space
- document collections
- named entities
- image features
- classification accuracy
- probabilistic model
- low level
- tf idf
- text summarization
- natural language