Analyzing mixed-type data by using word embedding for handling categorical features.
Chung-Chian HsuWei-Cyun TsaoArthur ChangChuan-Yu ChangPublished in: Intell. Data Anal. (2021)
Keyphrases
- data processing
- original data
- synthetic data
- data sources
- data sets
- database
- data structure
- prior knowledge
- raw data
- input data
- image data
- data analysis
- training data
- extracted features
- data quality
- multiple types
- data collection
- high quality
- attribute values
- categorical data
- special features
- structural information
- feature set
- co occurrence
- classification accuracy
- end users
- decision trees