Login / Signup
Audio-visual scene understanding utilizing text information for a cooking support robot.
Ryosuke Kojima
Osamu Sugiyama
Kazuhiro Nakadai
Published in:
IROS (2015)
Keyphrases
</>
audio visual
scene understanding
text information
vision system
multi modal
object recognition
object detection
d scene
visual information
video surveillance
multimedia
three dimensional
machine learning
nearest neighbor