An Approximately Duplicate Records Detection Method for Electric Power Big Data Based on Spark and IPOP-Simhash.
Ren-Jie SongTong YuYu-Hong ChenYu-Yang ChenBin XiaPublished in: J. Inf. Hiding Multim. Signal Process. (2018)
Keyphrases
- detection method
- big data
- electric power
- detection algorithm
- cloud computing
- power plant
- data processing
- data management
- data analysis
- social media
- face detection
- big data analytics
- unstructured data
- vast amounts of data
- business intelligence
- databases
- data science
- data warehousing
- knowledge discovery
- record linkage
- electricity markets
- data cleaning
- database
- region detection
- data warehouse
- fuzzy logic
- information systems
- data sets
- support vector data description