Stand-off Annotation of Web Content as a Legally Safer Alternative to Bitext Crawling for Distribution.
Mikel L. ForcadaMiquel Esplà-GomisJuan Antonio Pérez-OrtizPublished in: EAMT (2016)
Keyphrases
- web content
- web pages
- website
- user generated
- web data
- web documents
- web users
- semantic browsing
- web information
- search engine
- semantic annotation
- probability distribution
- social media
- automatic annotation
- metadata
- web browsing
- web mining
- web search engines
- user interests
- link analysis
- active learning
- rss feeds
- data mining