The NTT DCASE2020 Challenge Task 6 system: Automated Audio Captioning with Keywords and Sentence Length Estimation.
Yuma KoizumiDaiki TakeuchiYasunori OhishiNoboru HaradaKunio KashinoPublished in: CoRR (2020)
Keyphrases
- keywords
- multimedia
- natural language
- noun phrases
- semi automated
- signal processing
- text representation
- audio signals
- search engine
- audio stream
- automated tools
- semantic context
- word frequency
- keyword extraction
- semantic content
- audio visual
- keyword search
- parameter estimation
- visual features
- low level
- hidden markov models
- image retrieval