Hard and soft clustering of categorical time series based on two novel distances with an application to biological sequences.
Ángel López-OrionaJosé Antonio VilarPierpaolo D'UrsoPublished in: Inf. Sci. (2023)
Keyphrases
- biological sequences
- soft clustering
- longest common subsequence
- sequence data
- molecular biology
- multi source
- protein sequences
- computational biology
- distance measure
- biological data
- dna sequences
- dynamic time warping
- adaptive resonance theory
- distance function
- binding sites
- machine learning
- fuzzy c means
- euclidean distance
- clustering quality
- neural network
- database
- self organizing maps
- sequence databases
- data mining