Discovering Sublanguages in a Large Clinical Corpus through Unsupervised Machine Learning and Information Gain.
Terri Elizabeth WorkmanGuy DivitaQing Zeng-TreitlerPublished in: IEEE BigData (2019)
Keyphrases
- information gain
- machine learning
- decision trees
- feature selection
- text categorization
- chi square
- unsupervised learning
- mutual information
- chi squared
- supervised learning
- naive bayes
- learning algorithm
- text classification
- text mining
- machine learning algorithms
- occurrence frequency
- pattern recognition
- knowledge discovery
- data analysis
- natural language processing
- data mining
- semi supervised
- information extraction
- genetic programming
- model selection
- image registration
- semi supervised learning
- support vector
- correlation coefficient
- image processing
- computer vision
- support vector machine
- active learning