Instance Pruning by Filtering Uninformative Words: An Information Extraction Case Study.
Alfio Massimiliano GliozzoClaudio GiulianoRaffaella RinaldiPublished in: CICLing (2005)
Keyphrases
- information extraction
- case study
- word sense disambiguation
- text documents
- text mining
- natural language processing
- named entity recognition
- natural language text
- pruning method
- filtering algorithm
- precision and recall
- keywords
- information retrieval
- web documents
- textual data
- lessons learned
- semi structured
- pruning algorithms
- text processing
- web mining
- n gram
- adaptive filtering
- relation extraction
- text corpora
- word segmentation
- pruning algorithm
- text summarization
- development process
- free text
- machine learning
- information filtering
- named entities
- conditional random fields
- structured data
- wordnet
- search algorithm
- english words
- search space
- related words
- ontology based information extraction
- open domain
- natural language
- hidden markov models
- knowledge discovery
- data extraction