Towards Identifying and Reducing the Bias of Disease Information Extracted from Search Engine Data.
Da-Cang HuangJinfeng WangJi-Xia HuangDaniel Z. SuiHongyan ZhangMao-Gui HuCheng-Dong XuPublished in: PLoS Comput. Biol. (2016)
Keyphrases
- raw data
- data sets
- collected data
- search engine
- complex data
- computer systems
- web data
- data processing
- information sources
- huge amounts
- end users
- domain experts
- multiple sources
- pre processed
- sensor data
- sensitive information
- database
- log files
- digital data
- information resources
- essential information
- multimedia data
- keywords
- prior knowledge
- data points
- knowledge discovery
- data collection
- historical data
- log data
- information loss
- information access
- query processing
- information space
- missing information
- background knowledge
- xml documents
- data structure
- temporal information