Unifying convolution and transformer: a dual stage network equipped with cross-interactive multi-modal feature fusion and edge guidance for RGB-D salient object detection.

Published in: J. Ambient Intell. Humaniz. Comput. (2024)

Keyphrases