NICHE: A Curated Dataset of Engineered Machine Learning Projects in Python.
Ratnadira WidyasariZhou YangFerdian ThungSheng Qin SimFiona WeeCamellia LokJack PhanHaodi QiConstance TanQijin TayDavid LoPublished in: MSR (2023)
Keyphrases
- machine learning
- machine learning algorithms
- programming language
- feature selection
- pattern recognition
- learning algorithm
- decision trees
- information extraction
- open source
- natural language
- knowledge acquisition
- inductive learning
- software development
- statistical methods
- machine learning methods
- computer science
- case study
- supervised machine learning
- scientific databases
- machine learning approaches
- open source software
- computer vision
- benchmark datasets
- explanation based learning
- training dataset
- programming tool
- graphical user interface
- database
- semi supervised learning
- model selection
- text classification
- computational intelligence
- natural language processing
- knowledge representation
- active learning
- reinforcement learning
- artificial intelligence
- data mining
- neural network