• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

BYOL for Audio: Exploring Pre-Trained General-Purpose Audio Representations.

Daisuke NiizumiDaiki TakeuchiYasunori OhishiNoboru HaradaKunio Kashino
Published in: IEEE ACM Trans. Audio Speech Lang. Process. (2023)
Keyphrases
  • general purpose
  • multimedia
  • pre trained
  • visual information
  • audio visual
  • visual data
  • signal processing
  • face recognition
  • wide range
  • viewpoint
  • feature vectors
  • probabilistic model