Detecting similar HTML documents using a fuzzy set information retrieval approach.
Rajiv YerraYiu-Kai NgPublished in: GrC (2005)
Keyphrases
- fuzzy sets
- html documents
- information retrieval
- membership functions
- fuzzy set theory
- rough sets
- fuzzy logic
- structured documents
- web documents
- interval valued
- computational intelligence
- web page retrieval
- fuzzy relations
- fuzzy rules
- search engine
- web pages
- automatic extraction
- information extraction
- semantic information
- relevant documents
- vector space model
- pattern recognition
- language model
- fuzzy rough sets
- knowledge discovery
- regular expressions
- computer vision
- high level
- semistructured data
- low level
- information retrieval systems
- semi structured
- query expansion
- object oriented