Preparing Text Reports from Web Pages Employing Similarity Tests.
J. Guadalupe RamosJuan C. SolorioLourdes CampoySebastian RuizNicolas JassoPublished in: ENC (2013)
Keyphrases
- web pages
- keywords
- web documents
- textual content
- website
- plain text
- content features
- text documents
- text content
- sentence similarity
- similarity measure
- search engine
- web search
- anchor text
- semantic labels
- html pages
- web page classification
- text data
- text information
- semantic similarity
- database
- edit distance
- similarity function
- text representation
- distance measure
- text mining
- web communities
- technical papers
- information retrieval
- word pairs
- web images
- free text
- web search engines
- data records
- document similarity
- data extraction
- textual features
- reference set
- web browser
- hierarchical structure
- distance function
- web content mining