EDocDeDup: Electronic Document Data Deduplication Towards Storage Optimization.
Me Me KhaingN. JeyanthiPublished in: Int. J. Perform. Eng. (2023)
Keyphrases
- data sets
- database
- data quality
- training data
- image data
- evolutionary algorithm
- efficient storage
- missing data
- data analysis
- optimization problems
- data processing
- web documents
- synthetic data
- data structure
- raw data
- textual content
- stored data
- information retrieval systems
- data mining techniques
- information retrieval
- record linkage
- data mining