Cross Modal Audio Search and Retrieval with Joint Embeddings Based on Text and Audio.
Benjamin ElizaldeShuayb ZararBhiksha RajPublished in: ICASSP (2019)
Keyphrases
- cross modal
- multi modal
- multimedia retrieval
- video search
- image retrieval
- visual data
- multimedia databases
- visual similarity
- text retrieval
- visual recognition
- multiple modalities
- content based retrieval
- keywords
- semantic concepts
- visual information
- visual features
- nearest neighbor
- feature extraction
- information retrieval