Reshaping text data for efficient processing on Amazon EC2.
Gabriela TurcuIan T. FosterSvetlozar NestorovPublished in: HPDC (2010)
Keyphrases
- efficient processing
- text data
- text classification
- text mining
- query processing
- range queries
- high dimensional
- text documents
- structured data
- efficient implementation
- high dimensional data
- document collections
- multi dimensional
- web pages
- join algorithms
- databases
- database
- knn
- wordnet
- data sets
- similarity search
- information retrieval systems
- machine learning