A survey on multimodal bidirectional machine learning translation of image and natural language processing.
Wongyung NamBeakcheol JangPublished in: Expert Syst. Appl. (2024)
Keyphrases
- machine learning
- natural language processing
- image classification
- image data
- single image
- image features
- image representation
- text mining
- template matching
- input image
- multiscale
- image analysis
- image structure
- segmentation method
- high resolution
- information extraction
- computational linguistics
- image retrieval
- image segmentation
- image content
- data mining
- text processing
- computational biology
- machine translation
- low level
- hough transform
- similarity measure
- feature points
- edge detection
- multi modal
- region of interest
- decision trees
- image collections
- learning algorithm
- natural language
- multiresolution
- feature extraction
- scale space
- image set
- semantic relations
- image pixels