A distributed Arabic text classification approach using latent semantic analysis for big data.
Hadeel AlazzamAbdulsalam AlsmadyPublished in: CSIT (1) (2017)
Keyphrases
- big data
- text classification
- data intensive
- commodity hardware
- cloud computing
- big data analytics
- high volume
- data management
- data analysis
- text categorization
- feature selection
- vast amounts of data
- knowledge discovery
- data processing
- n gram
- bag of words
- distributed systems
- data science
- business intelligence
- unstructured data
- machine learning
- text mining
- social media
- data intensive computing
- text data
- massive data
- data warehousing
- distributed environment
- text documents
- information processing
- software engineering
- social computing
- data analytics
- query processing
- case study
- real world
- data driven decision making