Login / Signup
Attention Weights in Transformer NMT Fail Aligning Words Between Sequences but Largely Explain Model Predictions.
Javier Ferrando
Marta R. Costa-jussà
Published in:
CoRR (2021)
Keyphrases
</>
high level
computational model
cost function
experimental data
prior knowledge
decision making
objective function
weighting scheme
machine learning
mathematical model
statistical model
formal model
parameter estimation
high voltage
classification models
em algorithm
fuzzy logic
social networks