: Speech-Scene Graph Grounding Network for Speech-guided Navigation.

Dohyun Kim Yeseung Kim Jaehwi Jang Minjae Song Woojin Choi Daehyung Park

Published in: CoRR (2023)

Keyphrases

speech recognition
speech signal
text to speech
automatic speech recognition
speech synthesis
wireless sensor networks
audio visual
overlapping communities
network model
three dimensional
d scene
dynamic networks
single image
random walk
broadcast news
spoken language
path length
small world
graphical representation
bipartite graph
weighted graph
dynamic scenes
directed graph
network traffic
network structure
structured data
input image
object recognition
video sequences