Resolving the Imbalance Issue in Hierarchical Disciplinary Topic Inference via LLM-based Data Augmentation.
Xunxin CaiMeng XiaoZhiyuan NingYuanchun ZhouPublished in: ICDM (Workshops) (2023)
Keyphrases
- data sets
- database
- high dimensional data
- data collection
- data sources
- data processing
- raw data
- data structure
- xml documents
- data analysis
- original data
- end users
- image data
- input data
- data quality
- sensor data
- experimental data
- complex data
- relational databases
- neural network
- data distribution
- feature selection
- labeled data
- statistical analysis
- computer systems
- bayesian networks