Information in Data: Using the Oxford English Dictionary on a Computer.
Michael LeskPublished in: SIGIR Forum (1986)
Keyphrases
- raw data
- computer systems
- data collection
- end users
- information sources
- collected data
- data sets
- huge amounts
- complex data
- web data
- structural information
- log data
- historical data
- data analysis
- prior knowledge
- image data
- digital data
- missing information
- database
- data quality
- domain knowledge
- data processing
- essential information
- training data
- heterogeneous sources
- data records
- heterogeneous data
- keywords
- information extraction
- temporal information
- statistical analysis
- sensor data
- information resources
- domain experts
- background knowledge
- external data
- data management
- stored data
- probability distribution
- background information
- sensitive data
- privacy concerns
- log files
- data sources
- xml documents
- original data