LLMs with User-defined Prompts as Generic Data Operators for Reliable Data Processing.
Luyi MaNikhil ThakurdesaiJiao ChenJianpeng XuEvren KörpeogluSushant KumarKannan AchanPublished in: BigData (2023)
Keyphrases
- user defined
- data processing
- data analysis
- data sets
- data collection
- data structure
- data types
- data points
- high quality
- data acquisition
- raw data
- image data
- synthetic data
- training data
- query language
- data sources
- computer systems
- missing data
- original data
- input data
- domain specific
- management system
- knowledge discovery
- probability distribution
- end users
- metadata
- data mining
- databases