Information extraction from semi-structured and un-structured documents using probabilistic context free grammar inference.
Ramesh ThakurSuresh JainNarendra S. ChaudhariRahul SinghaiPublished in: CAMP (2012)
Keyphrases
- semi structured
- structured documents
- information extraction
- web documents
- information retrieval
- structured data
- xml documents
- data extraction
- information retrieval systems
- probabilistic context free grammars
- natural language processing
- text mining
- data model
- query language
- named entity recognition
- relevant documents
- text documents
- machine translation
- bayesian networks
- relation extraction
- natural language
- named entities
- machine learning
- document representation
- question answering
- keywords
- finite state
- natural language text