An information fusion framework with multi-channel feature concatenation and multi-perspective system combination for the deep-learning-based robust recognition of microphone array speech.
Yanhui TuJun DuQing WangXiao BaoLi-Rong DaiChin-Hui LeePublished in: Comput. Speech Lang. (2017)