Login / Signup
Audio to Score Matching by Combining Phonetic and Duration Information.
Rong Gong
Jordi Pons
Xavier Serra
Published in:
ISMIR (2017)
Keyphrases
</>
contextual information
spatial information
information retrieval
social networks
similarity measure
keywords
prior knowledge
domain knowledge
end users
multi modal
higher level
pattern matching
speech recognition
structural information
video signals
audio stream