Information Extraction from Semi-structured Resources: A Two-Phase Finite State Transducers Approach.
Vesna PajicGordana Pavlovic-LazeticMilos PajicPublished in: CIAA (2011)
Keyphrases
- semi structured
- information extraction
- structured knowledge
- finite state transducers
- structured data
- machine translation
- web documents
- data extraction
- free text
- text mining
- natural language processing
- semi structured data
- wrapper generation
- data model
- information retrieval
- machine learning
- web data sources
- finite state
- unstructured data
- web mining
- textual data
- language modeling
- vector space
- database
- markov chain
- active learning
- data sets