Pre-trained models, data augmentation, and ensemble learning for biomedical information extraction and document classification.
Arslan ErdengasilengQing HanTingting ZhaoShubo TianXin SuiKeqiao LiWanjing WangJian WangTing HuFeng PanYuan ZhangJinfeng ZhangPublished in: Database J. Biol. Databases Curation (2022)
Keyphrases
- document classification
- information extraction
- ensemble learning
- data sets
- text mining
- training data
- prior knowledge
- data analysis
- data distribution
- pre trained
- classification algorithm
- prediction accuracy
- high dimensional data
- web documents
- data points
- text classification
- knowledge discovery
- probabilistic model
- high dimensional
- test data
- text documents
- knowledge base
- information retrieval
- machine learning