Enhancing Embodied Object Detection through Language-Image Pre-training and Implicit Object Memory.
Nicolas Harvey ChapmanFeras DayoubWill N. BrowneChris LehnertPublished in: CoRR (2024)
Keyphrases
- object detection
- object detectors
- multiple objects
- bounding box
- image data
- input image
- image analysis
- object localization
- scene understanding
- image content
- object hypotheses
- image features
- pixel level
- region of interest
- single image
- detecting objects
- category level
- multiscale
- object categories
- spatial relations
- lighting conditions
- normalized correlation
- image regions
- image retrieval
- image representation
- image classification
- visual context
- spatial relationships
- high resolution
- d objects
- target object
- face detection
- complex scenes
- computer vision
- similar objects
- object models
- edge detection
- segmentation method
- object model
- object segmentation
- keypoints
- image segments
- image matching
- object recognition
- background clutter
- three dimensional objects
- scene recognition
- object classes
- partial occlusion
- test images