An Automatic Method to Extract Data from an Electronic Contract Composed of a Number of Documents in PDF Format.
Thomas KwokThao NguyenPublished in: CEC/EEE (2006)
Keyphrases
- synthetic data
- data collection
- data sets
- noisy data
- input data
- small number
- preprocessing
- prior knowledge
- test data
- missing data
- computational complexity
- database
- statistical methods
- data analysis
- objective function
- prior information
- fully automatic
- feature selection
- information loss
- knowledge discovery
- data points
- high dimensional data
- cluster centers
- random samples
- decision trees
- support vector machine
- probability distribution
- k means