Mining the Spoken Wikipedia for Speech Data and Beyond.
Arne KöhnFlorian StegenTimo BaumannPublished in: LREC (2016)
Keyphrases
- database
- data sets
- synthetic data
- knowledge discovery
- databases
- data mining techniques
- high quality
- original data
- data mining algorithms
- data collection
- data processing
- data mining methods
- data analysis
- data mining applications
- data mining
- web mining
- web data
- transactional data
- hidden knowledge
- text mining
- image data
- data sources
- training data
- knowledge base
- search engine
- information retrieval