Login / Signup
SEVEN: Pruning Transformer Model by Reserving Sentinels.
Jinying Xiao
Ping Li
Jie Nie
Zhe Tang
Published in:
CoRR (2024)
Keyphrases
</>
probability distribution
data mining
reinforcement learning
data structure
probabilistic model
object model
artificial intelligence
information systems
search space
prior knowledge
least squares
experimental data
formal model