pyannote.audio: neural building blocks for speaker diarization.
Hervé BredinRuiqing YinJuan Manuel CoriaGregory GellyPavel KorshunovMarvin LavechinDiego FustesHadrien TiteuxWassim BouazizMarie-Philippe GillPublished in: CoRR (2019)
Keyphrases
- building blocks
- speaker diarization
- audio stream
- broadcast news
- speaker identification
- speech recognition
- neural network
- multimedia
- hidden markov models
- pattern recognition
- language model
- error rate
- speech signal
- automatic speech recognition
- video search
- video sequences
- speaker verification
- bayesian information criterion
- information retrieval
- machine learning