A Natural Language Processing Pipeline for Detecting Informal Data References in Academic Literature.
Sara LafiaLizhou FanLibby HemphillPublished in: ASIST (2022)
Keyphrases
- data sets
- original data
- raw data
- data processing
- data analysis
- image data
- training data
- small number
- processing pipeline
- sensor data
- attribute values
- statistical analysis
- data mining techniques
- neural network
- natural language
- data mining
- high quality
- knowledge discovery
- high dimensional data
- complex data
- data quality
- machine learning
- spatial data
- learning algorithm
- social networks
- information systems
- input data
- text mining
- high dimensional
- probability distribution