Combining Swin Transformer and Attention-Weighted Fusion for Scene Text Detection.
Xianguo LiXingchen YaoYi LiuPublished in: Neural Process. Lett. (2024)
Keyphrases
- text detection
- urban scenes
- outdoor scenes
- natural scene images
- d scene
- scene text
- scene understanding
- object recognition
- single image
- urban environments
- video sequences
- text information
- three dimensional
- multiple images
- vanishing points
- point cloud
- dynamic scenes
- video analysis
- complex scenes
- visual attention
- connected components
- structure from motion
- moving objects
- aerial images
- ground plane
- multi band
- visual information
- detection method