Entropy based informative content density approach for efficient web content extraction.
Manjusha AnnamG. P. SajeevPublished in: ICACCI (2016)
Keyphrases
- content extraction
- web news
- text content
- web content
- digital archives
- news pages
- user generated content
- web documents
- website
- user generated
- web mining
- metadata
- domain knowledge
- user interests
- database
- databases
- web pages
- multimedia
- multimedia information retrieval
- semantic content
- web data
- semantic information
- social media