The Influence of Data Homogeneity on NLP System Performance.
Etienne DenoualPublished in: IJCNLP (companion) (2005)
Keyphrases
- data sets
- data distribution
- data collection
- high quality
- data points
- database systems
- data structure
- data quality
- database
- input data
- training data
- data sources
- machine learning
- experimental data
- synthetic data
- data processing
- natural language processing
- knowledge representation
- probability distribution
- data analysis
- computer systems
- statistical analysis
- question answering
- data acquisition
- raw data
- original data
- data objects
- databases