Extracting news text from web pages: an application for the visually impaired.
Erik LundgrenPanagiotis PapapetrouLars AskerPublished in: PETRA (2015)
Keyphrases
- keywords
- web pages
- web documents
- textual content
- plain text
- data extraction
- text content
- news articles
- website
- search engine
- news stories
- automatically extracting
- news video
- text documents
- automatically extracted
- information retrieval
- content features
- visually impaired users
- scientific papers
- html pages
- text retrieval
- cross media
- news topics
- web content
- news sources
- short texts
- text extraction
- financial news
- web server
- database
- topic detection
- web users
- anchor text
- free text
- text data
- news web sites
- web search engines
- user comments
- wordnet
- user generated content
- web page classification
- textual data