A Fast Text Similarity Measure for Large Document Collections using Multi-reference Cosine and Genetic Algorithm.
Hamid MohammadiSeyed Hossein KhastehPublished in: CoRR (2018)
Keyphrases
- genetic algorithm
- similarity measure
- euclidean distance
- cosine measure
- feature vectors
- multi objective
- reference set
- mutual information
- neural network
- genetic algorithm is applied
- cosine similarity measure
- free text
- text retrieval
- fitness function
- simulated annealing
- text mining
- pairwise
- keywords
- database
- distance measure
- metaheuristic
- genetic algorithm ga
- semantic similarity
- fuzzy logic
- textual data
- evolutionary algorithm