ViTCA-Net: a framework for disease detection in video capsule endoscopy images using a vision transformer and convolutional neural network with a specific attention mechanism.
Yassine OukdachZakaria KerkaouMohamed El AnsariLahcen KouttiAhmed Fouad El OuafdiThomas De LangePublished in: Multim. Tools Appl. (2024)
Keyphrases
- capsule endoscopy
- small bowel
- computer aided
- automatic detection
- tumor detection
- narrow band
- real time
- image data
- computer vision
- convolutional neural network
- multimedia
- object recognition
- image features
- input image
- object detection
- attention mechanism
- machine learning
- neural network
- feature extraction
- level set method
- image regions
- face detection
- active contours
- pairwise