Keyphrases
- database
- textual data
- information retrieval
- text mining
- text data
- data sets
- free text
- image data
- data quality
- data analysis
- high quality
- text documents
- databases
- data collection
- unstructured text
- text retrieval
- web documents
- training examples
- data processing
- supervised learning
- data sources
- keywords
- statistical analysis
- text categorization
- information retrieval systems
- missing data
- information extraction
- knowledge discovery
- probability distribution
- prior knowledge
- text summarization
- textual content