The "Collections as ML Data" Checklist for Machine Learning & Cultural Heritage.
Benjamin Charles Germain LeePublished in: CoRR (2022)
Keyphrases
- cultural heritage
- machine learning
- data analysis
- data sets
- data collection
- data processing
- heterogeneous collections
- data quality
- statistical methods
- digital libraries
- training data
- maximum likelihood
- data sources
- database
- digital collections
- complex data
- information retrieval
- raw data
- data mining techniques
- knowledge discovery
- machine learning algorithms
- missing data
- multimedia
- statistical analysis
- data structure
- data mining
- digital archives
- data points