Torchaudio: Building Blocks for Audio and Speech Processing.
Yao-Yuan YangMoto HiraZhaoheng NiArtyom AstafurovCaroline ChenChristian PuhrschDavid PollackDmitriy GenzelDonny GreenbergEdward Z. YangJason LianJeff HwangJi ChenPeter GoldsboroughSean NarenthiranShinji WatanabeSoumith ChintalaVincent Quenneville-BélairPublished in: ICASSP (2022)
Keyphrases
- building blocks
- speech processing
- signal processing
- speaker identification
- speech recognition
- natural language processing
- multimedia systems
- artificial intelligence
- machine learning
- english text
- variable length
- multimedia
- image processing
- speech signal
- gaussian mixture model
- database systems
- multi modal
- broadcast news
- feature extraction
- database