The Engage Corpus: A Social Media Dataset for Text-Based Recommender Systems.
Daniel ChengKyle YanPhillip KeungNoah A. SmithPublished in: LREC (2022)
Keyphrases
- recommender systems
- social media
- netflix prize
- collaborative filtering
- user generated content
- social media data
- social networks
- information filtering
- user preferences
- benchmark datasets
- million images
- textual features
- user profiling
- user modeling
- image search
- training dataset
- manually annotated
- social networking
- cold start problem
- big data
- multimedia
- test set
- user profiles
- language model
- matrix factorization
- data sparsity
- trust aware
- database