SMILK, linking natural language and data from the web.
Cédric LopezMolka DhouibElena CabrioCatherine Faron-ZuckerFabien GandonFrédérique SegondPublished in: CoRR (2019)
Keyphrases
- data sets
- synthetic data
- database
- data sources
- website
- data structure
- web data
- missing data
- information sources
- data collection
- data distribution
- machine learning
- natural language
- data analysis
- experimental data
- log files
- original data
- data objects
- data quality
- log data
- web documents
- data processing
- input data
- data mining techniques
- natural language processing
- knowledge discovery
- knowledge representation
- high quality
- web pages