NICHE: A Curated Dataset of Engineered Machine Learning Projects in Python.
Ratnadira WidyasariZhou YangFerdian ThungSheng Qin SimFiona WeeCamellia LokJack PhanHaodi QiConstance TanQijin TayDavid LoPublished in: CoRR (2023)
Keyphrases
- machine learning
- pattern recognition
- open source
- benchmark datasets
- programming language
- feature selection
- data analysis
- knowledge acquisition
- machine learning algorithms
- case study
- explanation based learning
- data sets
- learning systems
- inductive logic programming
- inductive learning
- text mining
- machine learning approaches
- supervised learning
- knowledge discovery
- reinforcement learning
- decision trees
- learning algorithm
- open source software
- scientific databases
- development process
- machine learning methods
- semi supervised learning
- text classification
- software development
- natural language processing
- computer science
- data mining