Combining URL and HTML Features for Entity Discovery in the Web.
Edimar ManicaCarina Friedrich DornelesRenata GalantePublished in: ACM Trans. Web (2019)
Keyphrases
- web pages
- website
- web documents
- content features
- textual features
- semi structured
- specific features
- feature set
- web content
- dynamic content
- feature vectors
- web applications
- web technologies
- web mining
- web databases
- web crawler
- web browser
- web users
- pattern discovery
- data mining
- named entities
- co occurrence
- image features
- feature selection
- search engine