Genre-Oriented Web Content Extraction with Deep Convolutional Neural Networks and Statistical Methods.
Bao-Dai Nguyen-HoangBao-Tran Pham-HongYiping JinPhu T. V. LePublished in: PACLIC (2018)
Keyphrases
- statistical methods
- content extraction
- convolutional neural networks
- web news
- statistical analysis
- web documents
- text content
- digital archives
- web pages
- machine learning methods
- machine learning
- website
- statistical models
- statistical approaches
- html documents
- semantic web
- web content
- web mining
- user generated content
- news pages
- machine learning algorithms
- semi structured data
- knowledge discovery
- database systems
- information retrieval
- database