I see what you hear: a vision-inspired method to localize words.
Mohammad SamraghArnav KunduTing-Yao HuMinsik ChoAman ChadhaAshish ShrivastavaOncel TuzelDevang NaikPublished in: CoRR (2022)
Keyphrases
- experimental evaluation
- high accuracy
- clustering method
- significant improvement
- computational cost
- cost function
- high precision
- segmentation method
- synthetic data
- detection method
- optimization algorithm
- prior knowledge
- image processing
- computer vision
- information retrieval
- dynamic programming
- preprocessing
- computational complexity
- mutual information
- computationally efficient
- text categorization
- objective function
- machine learning
- classification method
- real time