: Speech-Scene Graph Grounding Network for Speech-guided Navigation.
Dohyun KimYeseung KimJaehwi JangMinjae SongWoojin ChoiDaehyung ParkPublished in: CoRR (2023)
Keyphrases
- speech recognition
- speech signal
- text to speech
- automatic speech recognition
- speech synthesis
- wireless sensor networks
- audio visual
- overlapping communities
- network model
- three dimensional
- d scene
- dynamic networks
- single image
- random walk
- broadcast news
- spoken language
- path length
- small world
- graphical representation
- bipartite graph
- weighted graph
- dynamic scenes
- directed graph
- network traffic
- network structure
- structured data
- input image
- object recognition
- video sequences