Quantifying the Vanishing Gradient and Long Distance Dependency Problem in Recursive Neural Networks and Recursive LSTMs.
Phong LeWillem H. ZuidemaPublished in: Rep4NLP@ACL (2016)
Keyphrases
- long distance
- recursive neural networks
- neural network
- machine learning
- mutual exclusion
- gesture recognition
- edge detection
- computer technology
- information systems
- gradient information
- gradient direction
- upper layer
- image processing
- statistically significant
- zero crossing
- datalog programs
- gradient method
- recursive algorithm
- recursive queries
- learning algorithm