Data Readiness for Natural Language Processing.
Fredrik OlssonMagnus SahlgrenPublished in: CoRR (2020)
Keyphrases
- natural language processing
- data sets
- complex data
- data points
- original data
- training data
- data analysis
- data sources
- data mining techniques
- sensor data
- high dimensional data
- statistical analysis
- raw data
- data collection
- data processing
- probability distribution
- prior knowledge
- natural language
- high quality
- machine learning
- historical data
- statistical methods
- computer systems
- input data
- social media
- databases