Unifying convolution and transformer: a dual stage network equipped with cross-interactive multi-modal feature fusion and edge guidance for RGB-D salient object detection.
Shilpa Elsa AbrahamBinsu C. KovoorPublished in: J. Ambient Intell. Humaniz. Comput. (2024)