Learning to Maximize Mutual Information for Chain-of-Thought Distillation.

Xin Chen Hanxian Huang Yanjun Gao Yi Wang Jishen Zhao Ke Ding

Published in: CoRR (2024)

Keyphrases

mutual information
learning algorithm
pattern recognition
prior knowledge
elementary school
learning scheme
learning problems
learning tasks
information theoretic
learning systems
online learning
data mining
neural network
text classification
supervised learning
empirical studies
learning process
background knowledge
machine learning
real time
database