TF-IDF vs Word Embeddings for Morbidity Identification in Clinical Notes: An Initial Study.
Danilo DessìRim HelaouiVivek KumarDiego Reforgiato RecuperoDaniele RiboniPublished in: CoRR (2021)
Keyphrases
- tf idf
- term frequency
- term weighting
- stop words
- weighting scheme
- vector space model
- inverse document frequency
- information retrieval
- text categorization
- vector space
- retrieval model
- text documents
- document clustering
- document frequency
- co occurrence
- average precision
- ranking algorithm
- n gram
- semantic similarity
- retrieval systems
- keywords