Cleaning Noisy and Heterogeneous Metadata for Record Linking Across Scholarly Big Datasets.
Athar SefidJian WuAllen C. GeJing ZhaoLu LiuCornelia CarageaPrasenjit MitraC. Lee GilesPublished in: CoRR (2019)
Keyphrases
- metadata
- digital libraries
- heterogeneous data
- database
- heterogeneous databases
- semantically rich
- databases
- metadata extraction
- benchmark datasets
- uci machine learning repository
- heterogeneous sources
- semantic information
- noisy data
- multimedia
- social networks
- metadata management
- data management
- structured data
- learning resources
- information resources
- search tools
- learning objects
- data mining