Robust methods in analysis of natural language data.
Afzal BallimVincenzo PallottaPublished in: Nat. Lang. Eng. (2002)
Keyphrases
- data analysis
- high dimensional data
- statistical methods
- data mining techniques
- data sets
- natural language
- data analysis tasks
- database
- missing values
- statistical analysis
- noisy data
- descriptive statistics
- input data
- data processing
- statistical tests
- data representations
- machine learning
- training data
- experimental data
- contingency tables
- raw data
- significant improvement
- synthetic data
- high quality
- computer systems
- data collection
- multiple sources
- empirical data
- data acquisition
- data mining methods
- databases
- original data
- human experts
- data mining algorithms
- missing data
- benchmark datasets
- data structure
- preprocessing
- information extraction