Lexical quality as a proxy for web text understandability.
Luz RelloRicardo Baeza-YatesPublished in: WWW (Companion Volume) (2012)
Keyphrases
- web documents
- text information
- web applications
- website
- textual data
- software quality
- information retrieval and extraction
- lexical features
- information retrieval
- textual features
- linguistic analysis
- keywords
- text retrieval
- high quality
- natural language text
- end users
- web mining
- web pages
- chinese text
- syntactic features
- web users
- database
- semantic web
- natural language processing
- web images
- textual information
- word sense
- context sensitive
- text corpus
- syntactic information
- information sources
- text mining
- recognizing textual entailment
- information extraction
- metadata
- textual entailment recognition