Proposal and study of statistical features for string similarity computation and classification.
Érick Oliveira RodriguesDalcimar CasanovaMarcelo TeixeiraVinicius PegoriniFábio FavarimEsteban Walter Gonzalez CluaAura ConciPanos LiatsisPublished in: Int. J. Data Min. Model. Manag. (2020)
Keyphrases
- similarity computation
- feature vectors
- feature extraction
- feature set
- feature space
- feature selection
- support vector machine
- image classification
- data structure
- co occurrence
- text classification
- similarity metrics
- pattern recognition
- training set
- similarity measure
- distance measure
- web documents
- feature construction
- cosine similarity
- data sets