Sign in

Identifying duplicate content using statistically improbable phrases.

Mounir ErramiZhaohui SunAngela C. GeorgeTara C. LongMichael A. SkinnerJonathan D. WrenHarold R. Garner
Published in: Bioinform. (2010)
Keyphrases
  • digital content
  • database
  • metadata
  • multimedia content
  • user generated
  • data sets
  • databases
  • multimedia
  • web content
  • neural network
  • machine learning
  • information systems
  • web documents
  • multimedia data