Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset.
Hugo LaurençonLéo TronchonVictor SanhPublished in: CoRR (2024)
Keyphrases
- web pages
- website
- web development
- web applications
- web documents
- source code
- source files
- web resources
- web mining
- benchmark datasets
- end users
- web data
- step by step instructions
- dynamic content
- synthetic datasets
- web technologies
- data mining
- web users
- linked data
- information extraction
- database
- information overload
- web content
- semi structured
- data extraction
- open source
- database driven
- social media