Login / Signup

Categorizing and Extracting Information from Multilingual HTML Documents.

Seung Jin LimYiu-Kai Ng
Published in: IDEAS (2005)
Keyphrases
  • web documents
  • html documents
  • domain knowledge
  • low level
  • information sources
  • keywords
  • prior knowledge
  • information extraction
  • contextual information
  • repeated patterns