A comparison of techniques for estimating IDF values to generate lexical signatures for the web.
Martin KleinMichael L. NelsonPublished in: WIDM (2008)
Keyphrases
- website
- context sensitive
- web applications
- information sources
- web documents
- semantic web
- web content
- signature recognition
- user generated content
- tf idf
- web resources
- web users
- attribute values
- domain specific
- wordnet
- web data
- natural language processing
- information extraction
- digital libraries
- social networks
- term weighting
- information retrieval