Structured Querying of Web Text Data: A Technical Challenge.
Michael J. CafarellaChristopher RéDan SuciuOren EtzioniPublished in: CIDR (2007)
Keyphrases
- text data
- technical challenges
- structured data
- unstructured text
- web pages
- textual data
- text mining
- text classification
- high dimensional
- text documents
- high dimensional data
- topic hierarchies
- document collections
- databases
- real world
- database
- neural network
- information extraction
- web documents
- information retrieval systems
- knowledge discovery
- digital libraries
- metadata
- data sets
- text analytics
- nmr spectra