Prediction of actions and places by the time series recognition from images with Multimodal LLM.
Tomohiro OgawaKango YoshiokaKen FukudaTakeshi MoritaPublished in: ICSC (2024)
Keyphrases
- object recognition
- image recognition
- image data
- image database
- image classification
- three dimensional
- image registration
- prediction model
- image analysis
- input image
- image matching
- ground truth
- recognition rate
- image set
- visual object recognition
- multi modal
- prediction accuracy
- computer vision
- traffic signs
- image collections
- image retrieval
- character recognition
- segmentation method
- financial time series
- handwritten digits
- illumination invariant face recognition
- plan recognition
- recognition accuracy
- test images
- activity recognition
- edge detection
- image processing