Visual and language semantic hybrid enhancement and complementary for video description.

Published in: Neural Comput. Appl. (2022)

Keyphrases