DOM-based print-link detection for web article extraction.
Sam LiuSuk Hwan LimJerry LiuPublished in: Imaging and Printing in a Web 2.0 World (2011)
Keyphrases
- website
- web pages
- web information extraction
- data extraction
- web applications
- linked data
- information sources
- web documents
- web content
- web information retrieval
- semantic web
- web resources
- user experience
- web information
- user generated content
- web users
- data sets
- relational databases
- automatic extraction
- information extraction
- social networks
- neural network