Sign in

TAMformer: Multi-Modal Transformer with Learned Attention Mask for Early Intent Prediction.

Nada OsmanGuglielmo CamporeseLamberto Ballan
Published in: ICASSP (2023)
Keyphrases
  • multi modal
  • image annotation
  • multi modality
  • high dimensional
  • cross modal
  • fuzzy logic
  • fault diagnosis
  • audio visual
  • semantic concepts
  • computer vision
  • face recognition
  • image classification