ACTUAL: Audio Captioning With Caption Feature Space Regularization.
Yiming ZhangHong YuRuoyi DuZheng-Hua TanWenwu WangZhanyu MaYuan DongPublished in: IEEE ACM Trans. Audio Speech Lang. Process. (2023)
Keyphrases
- feature space
- visual features
- high dimensional
- visual information
- feature vectors
- multimedia
- image retrieval
- classification accuracy
- high dimensionality
- regularization parameter
- data points
- input space
- image representation
- support vector machine
- mean shift
- visual data
- signal processing
- dimensionality reduction
- hyperplane
- audio visual
- mercer kernels
- audio signals
- text extraction
- kernel function
- training samples
- multi modal
- feature selection
- kernel methods
- image restoration
- input data
- principal component analysis
- feature extraction
- video shots
- image processing
- polynomial kernels
- machine learning