A novel method for image captioning using multimodal feature fusion employing mask RNN and LSTM models.
Kumaravel ThangavelNatesan PalanisamySuresh MuthusamyOm Prava MishraSuma Christal Mary SundararajanHitesh PanchalAshok Kumar LoganathanPonarun RamamoorthiPublished in: Soft Comput. (2023)