Correspondence Factor Analysis of Big Data Sets: A Case Study of 30 Million Words; and Contrasting Analytics using Apache Solr and Correspondence Analysis in R.
Fionn MurtaghPublished in: CoRR (2015)
Keyphrases
- correspondence analysis
- factor analysis
- open source
- data sets
- big data
- latent factors
- independent component analysis
- component analysis
- discriminant analysis
- cluster analysis
- real world
- matrix factorization
- text documents
- data mining
- statistical tests
- data analysis
- statistical analysis
- semi supervised
- feature vectors
- document representation
- neural network
- databases