A Data-Driven Methodology for Guiding the Selection of Preprocessing Techniques in a Machine Learning Pipeline.
Jorge García-CarrascoAlejandro MatéJuan TrujilloPublished in: CAiSE Forum (2023)
Keyphrases
- data driven
- machine learning
- preprocessing
- pattern recognition
- machine learning algorithms
- feature selection
- machine learning approaches
- post processing
- knowledge acquisition
- learning tasks
- model driven
- learning algorithm
- computer science
- selection algorithm
- knowledge representation
- inductive logic programming
- statistical methods
- text classification
- statistical machine learning
- data mining
- selection strategy
- preprocessing steps
- knowledge engineering
- databases
- text mining
- feature extraction
- neural network
- model selection
- information retrieval
- preprocessing phase
- selection criteria
- machine learning and data mining
- e learning
- design methodology
- computational biology
- preprocessing step
- active learning
- support vector machine
- information extraction
- natural language processing