Login / Signup
IPA-CLIP: Integrating Phonetic Priors into Vision and Language Pretraining.
Chihaya Matsuhira
Marc A. Kastner
Takahiro Komamizu
Takatsugu Hirayama
Keisuke Doman
Yasutomo Kawanishi
Ichiro Ide
Published in:
CoRR (2023)
Keyphrases
</>
programming language
language learning
real time
database
computer vision
image processing
learned from training data
language processing
vision system
speech recognition
prior knowledge
natural language
specification language
spoken language
object oriented programming
computational linguistics
visual field
neural network