Clustering for high dimensional categorical data based on text similarity.
G. Surya NarayanaD. VasumathiPublished in: ICCIP (2016)
Keyphrases
- categorical data
- high dimensional
- cluster analysis
- parameter free
- numerical data
- categorical attributes
- binary data
- numeric data
- numerical attributes
- correspondence analysis
- data points
- attribute values
- density based clustering
- high dimensionality
- low dimensional
- similarity measure
- information retrieval
- similarity search
- feature space
- similarity function
- distance function
- distance based outlier detection
- text documents
- database
- clustering method
- data mining techniques
- text mining
- nearest neighbor
- data analysis
- learning algorithm
- data mining