A Natural Language Processing Pipeline for Detecting Informal Data References in Academic Literature.
Sara LafiaLizhou FanLibby HemphillPublished in: CoRR (2022)
Keyphrases
- data sets
- statistical analysis
- processing pipeline
- database
- data collection
- natural language
- complex data
- data sources
- raw data
- application domains
- high quality
- data structure
- data analysis
- databases
- historical data
- original data
- training data
- end users
- prior knowledge
- expert systems
- data processing
- neural network
- high dimensional data
- sensor data
- spatial data
- natural language processing
- data distribution
- data acquisition
- missing values
- data points
- data streams