NoDoSE - A Tool for Semi-Automatically Extracting Semi-Structured Data from Text Documents.
Brad AdelbergPublished in: SIGMOD Conference (1998)
Keyphrases
- text documents
- automatically extracting
- semi structured data
- text mining
- structured data
- information extraction
- semi structured
- text classification
- web mining
- topic models
- text categorization
- text data
- bag of words
- keywords
- document representation
- document clustering
- xml data
- named entities
- wordnet
- xml documents
- semi supervised learning
- web data
- data sets
- question answering
- multiscale
- anchor text
- machine learning