A clustering framework for lexical normalization of Roman Urdu.
Abdul Rafae KhanAsim KarimHassan SajjadFaisal KamiranJia XuPublished in: Nat. Lang. Eng. (2022)
Keyphrases
- clustering framework
- clustering method
- clustering algorithm
- similarity metric
- k means
- wordnet
- text clustering
- high dimensional datasets
- natural language processing
- semi supervised
- language identification
- sentiment analysis
- semantic relations
- self organizing maps
- similarity measure
- data clustering
- high dimensional
- text documents
- keywords
- data sets