More Robots are Coming: Large Multimodal Models (ChatGPT) can Solve Visually Diverse Images of Parsons Problems.
Irene HouOwen ManSophia MettilleSebastian GutierrezKenneth AngelikasStephen MacNeilPublished in: ACE (2024)
Keyphrases
- multimodal image registration
- image registration
- solving complex
- optimization problems
- cooperative
- parameter estimation
- mathematical programming
- ground truth
- d objects
- image data
- input image
- image database
- edge detection
- mobile robot
- keypoints
- path planning
- region of interest
- multiple images
- image analysis
- three dimensional