mwetoolkit-lib: Adaptation of the mwetoolkit as a Python Library and an Application to MWE-based Document Clustering.
Fernando Rezende ZagattiPaulo Augusto de Lima MedeirosEsther da Cunha SoaresLucas Nildaimon dos Santos SilvaCarlos RamischLivy RealPublished in: MWE@LREC2022 (2022)
Keyphrases
- document clustering
- multiword
- text clustering
- document representation
- text mining
- clustering method
- vector space model
- document collections
- clustering algorithm
- text documents
- document clusters
- tf idf
- context sensitive
- clustering quality
- document retrieval
- biomedical literature
- information retrieval
- k nearest neighbor
- information extraction
- knn
- k means
- feature space
- metadata
- databases