First is Better Than Last for Language Data Influence.
Chih-Kuan YehAnkur TalyMukund SundararajanFrederick LiuPradeep RavikumarPublished in: NeurIPS (2022)
Keyphrases
- data collection
- data sets
- database
- training data
- data processing
- data sources
- data analysis
- knowledge discovery
- data quality
- application domains
- data structure
- high quality
- image data
- search engine
- experimental data
- natural language
- prior knowledge
- feature selection
- statistical analysis
- information retrieval
- data distribution
- temporal information
- statistical methods
- data objects
- data mining
- complex data