Language Query-Based Transformer With Multiscale Cross-Modal Alignment for Visual Grounding on Remote Sensing Images.
Meng LanFu RongHongzan JiaoZhi GaoLefei ZhangPublished in: IEEE Trans. Geosci. Remote. Sens. (2024)
Keyphrases
- cross modal
- remote sensing images
- multiscale
- multimedia retrieval
- multi modal
- remote sensing
- change detection
- multispectral
- visual data
- perceptual information
- query processing
- image retrieval
- multimedia databases
- visual recognition
- image fusion
- multimedia information retrieval
- image processing
- query expansion
- text retrieval
- visual similarity
- multimedia data
- satellite images
- range queries
- image data
- data sources
- image segmentation
- hyperspectral
- keywords
- multimedia