DomainNet: Homograph Detection for Data Lake Disambiguation.
Aristotelis LeventidisLaura Di RoccoWolfgang GatterbauerRenée J. MillerMirek RiedewaldPublished in: CoRR (2021)
Keyphrases
- data sets
- synthetic data
- raw data
- data objects
- data structure
- data analysis
- prior knowledge
- data sources
- data processing
- data acquisition
- high dimensional data
- input data
- data mining techniques
- data management
- small number
- original data
- experimental data
- data quality
- statistical analysis
- data collection
- natural language processing
- knowledge discovery
- training data
- search engine
- information retrieval