Audio-visual speech recognition based on regulated transformer and spatio-temporal fusion strategy for driver assistive systems.
Dmitry RyuminAlexandr AxyonovElena RyuminaDenis IvankoAlexey M. KashevnikAlexey KarpovPublished in: Expert Syst. Appl. (2024)