Chain-of-Look Prompting for Verb-centric Surgical Triplet Recognition in Endoscopic Videos.
Nan XiJingjing MengJunsong YuanPublished in: ACM Multimedia (2023)
Keyphrases
- minimally invasive
- human activities
- recognition rate
- endoscopic video
- minimally invasive surgery
- object recognition
- recognition accuracy
- recognition algorithm
- pattern recognition
- video analysis
- surgical instruments
- feature extraction
- real time
- recognition process
- video sequences
- robot assisted
- user centric
- computer vision
- video data
- natural language
- intraoperative
- gesture recognition
- hand gestures
- character recognition
- video clips
- human actions
- static images
- video content
- video frames
- activity recognition