Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings.

Published in: INTERSPEECH (2022)

Keyphrases