Old spellings, new methods: automated procedures for indeterminate linguistic data.
Hugh CraigR. WhippPublished in: Lit. Linguistic Comput. (2010)
Keyphrases
- data sets
- data collection
- statistical methods
- high dimensional data
- data analysis
- data mining methods
- data processing
- original data
- human experts
- missing values
- image data
- incomplete data
- data mining techniques
- data sources
- statistical analysis
- raw data
- missing data
- preprocessing
- text classification
- data reduction
- high quality
- large scale data sets
- data representations
- database
- data quality
- data mining applications
- complex structures
- statistical tests
- noisy data
- data distribution
- spatial data
- benchmark datasets
- medical images
- natural language processing
- end users
- significant improvement
- data structure
- training data