DataCockpit: A Toolkit for Data Lake Navigation and Monitoring Utilizing Quality and Usage Information.
Arpit NarechaniaSurya ChakrabortyShivam AgarwalAtanu R. SinhaRyan A. RossiFan DuJane HoffswellShunan GuoEunyee KohAlex EndertShamkant B. NavathePublished in: BigData (2023)
Keyphrases
- raw data
- data sets
- collected data
- high quality
- data collection
- complex data
- information sources
- sensor data
- information space
- historical data
- data processing
- database
- essential information
- missing information
- sensitive information
- heterogeneous data
- information resources
- web data
- data quality
- huge amounts
- prior knowledge
- data acquisition
- data analysis
- data structure
- computer systems
- end users
- structural information
- digital data
- input data
- temporal information
- data sources
- log data
- external data
- log files
- information services
- prior information
- original data
- data mining techniques
- training data
- real time