Keyphrases
- web documents
- information extraction
- web pages
- document classification
- semi structured
- fuzzy sets
- clustering algorithm
- web search engines
- textual information
- vector space model
- keywords
- document representation
- relational data
- web content
- semi supervised
- databases
- text mining
- web data
- focused crawling
- collaborative filtering
- natural language processing
- website
- xml documents
- social annotations