Sign in

Speech activity detection and face orientation estimation using multiple microphone arrays and human position information.

Carlos Toshinori IshiJani EvenNorihiro Hagita
Published in: IROS (2015)
Keyphrases
  • position information
  • orientation estimation
  • speaker diarization
  • virtual humans
  • sound source
  • image processing
  • virtual environment
  • visual information
  • facial images
  • rotation invariant
  • smart room