Login / Signup

UniTR: A Unified TRansformer-Based Framework for Co-Object and Multi-Modal Saliency Detection.

Ruohao GuoXianghua YingYanyu QiLiao Qu
Published in: IEEE Trans. Multim. (2024)
Keyphrases
  • multi modal
  • saliency detection
  • audio visual
  • multiple modalities
  • object level
  • visual features
  • video content
  • semantic concepts
  • object detectors
  • d objects
  • object detection
  • natural images
  • image content