Is it Time to Swish? Comparing Deep Learning Activation Functions Across NLP tasks.
Steffen EgerPaul YoussefIryna GurevychPublished in: EMNLP (2018)
Keyphrases
- deep learning
- activation function
- neural network
- unsupervised learning
- natural language processing
- machine learning
- artificial neural networks
- feed forward
- neural nets
- hidden layer
- information extraction
- back propagation
- question answering
- mental models
- weakly supervised
- learning algorithm
- knn
- genetic algorithm
- recurrent neural networks
- data sets