Login / Signup

Multimodal embedding fusion for robust speaker role recognition in video broadcast.

Mickael RouvierSebastien DelecrazBenoît FavreMeriem BendrisFrédéric Béchet
Published in: ASRU (2015)
Keyphrases