Massively scalable near duplicate detection in streams of documents using MDSH.

Published in: IEEE BigData (2013)

Keyphrases