Consolidating client names in the lobbying disclosure database using efficient clustering techniques.
Rajan Kumar KharelNiju ShresthaChengcui ZhangGrant T. SavageAriel D. SmithPublished in: ACM Southeast Regional Conference (2014)
Keyphrases
- database
- clustering algorithm
- databases
- clustering method
- k means
- data sets
- statistical databases
- database systems
- unsupervised learning
- database applications
- self organizing maps
- hierarchical clustering
- data model
- object oriented
- data management
- keywords
- named entities
- cluster analysis
- information loss
- categorical data