Information Extraction in Structured Documents Using Tree Automata Induction.
Raymond KosalaJan Van den BusscheMaurice BruynoogheHendrik BlockeelPublished in: PKDD (2002)
Keyphrases
- structured documents
- tree automata
- information extraction
- web documents
- information retrieval
- regular expressions
- finite automata
- structured document retrieval
- query language
- machine learning
- finite state
- text mining
- information retrieval systems
- natural language processing
- xml documents
- question answering
- named entity recognition
- relevant documents
- document representation
- natural language text
- relation extraction
- named entities
- structured data
- machine translation
- query evaluation
- context free grammars
- databases
- hidden markov models
- natural language
- data mining