Classifying Short Unstructured Data Using the Apache Spark Platform.
Eduardo P. S. CastroSaurabh ChakravartyEric WilliamsonDenilson Alves PereiraEdward A. FoxPublished in: JCDL (2017)
Keyphrases
- unstructured data
- structured data
- textual data
- big data
- semi structured
- relational databases
- open source
- structured and unstructured data
- raw data
- information management
- data warehouse
- semi structured data
- information retrieval
- information extraction
- data sources
- metadata
- case study
- database
- machine learning
- artificial intelligence
- data integration
- business intelligence
- information processing
- object oriented
- knowledge discovery