Sign in

DEJA-VU: Double Feature Presentation and Iterated Loss in Deep Transformer Networks.

Andros TjandraChunxi LiuFrank ZhangXiaohui ZhangYongqiang WangGabriel SynnaeveSatoshi NakamuraGeoffrey Zweig
Published in: ICASSP (2020)
Keyphrases