Simplified Data Wrangling with ir_datasets.
Sean MacAvaneyAndrew YatesSergey FeldmanDoug DowneyArman CohanNazli GoharianPublished in: CoRR (2021)
Keyphrases
- data sets
- data structure
- raw data
- synthetic data
- data analysis
- image data
- database
- data sources
- data processing
- computer systems
- data quality
- data collection
- input data
- probability distribution
- test data
- experimental conditions
- data repositories
- decision trees
- web search
- data mining algorithms
- statistical analysis
- prior knowledge
- knowledge discovery
- spatial data
- data distribution
- high quality
- network structure
- data objects
- information retrieval
- complex data
- statistical significance
- sampling methods