Squeeze-Excitation Convolutional Recurrent Neural Networks for Audio-Visual Scene Classification.
Javier Naranjo-AlcazarSergi Perez-CastanosMaximo CobosFrancesc J. FerriPedro ZuccarelloPublished in: DCASE (2021)
Keyphrases
- audio visual
- recurrent neural networks
- scene classification
- object recognition
- multi modal
- natural scenes
- image classification
- biologically inspired
- visual information
- visual words
- feed forward
- neural network
- image representation
- multimedia
- visual data
- natural images
- artificial neural networks
- bag of features
- echo state networks
- higher order
- image features
- high dimensional data
- multiscale
- search engine