Login / Signup

Extracting the Main Content of Web Documents Based on Character Encoding and a Naive Smoothing Method.

Hadi MohammadzadehThomas GottronFranz SchweiggertGholamreza Nakhaeizadeh
Published in: ICSOFT (Selected Papers) (2011)
Keyphrases
  • web documents
  • clustering method
  • similarity measure
  • web pages
  • databases
  • web mining
  • web data