Vector Space Representations of Documents in Classifying Finnish Social Media Texts.
Viljami VenekoskiSamir PuuskaJouko VankkaPublished in: ICIST (2016)
Keyphrases
- vector space
- social media
- vector space model
- document representation
- latent semantic indexing
- cosine similarity
- retrieval model
- text documents
- euclidean space
- feature vectors
- distance measure
- training documents
- low dimensional
- social networks
- concept space
- similarity search
- keywords
- information retrieval
- machine learning
- high dimensional
- tf idf
- similarity measure
- database
- image classification
- principal component analysis
- riemannian manifolds
- face recognition
- index terms