Structuring Domain-Specific Text Archives by Deriving a Probabilistic XML DTD.
Karsten WinklerMyra SpiliopoulouPublished in: PKDD (2002)
Keyphrases
- domain specific
- domain independent
- multimedia
- general purpose
- text retrieval
- information retrieval
- probabilistic model
- historical manuscripts
- digital libraries
- keywords
- text mining
- bayesian networks
- metadata
- textual data
- string matching
- generative model
- natural language text
- text information
- relation extraction
- uncertain data
- free text
- probabilistic logic
- text categorization
- automatically extracted
- cultural heritage
- context sensitive
- text documents
- database
- text classification
- language model
- feature selection
- data sets