A Corpus of Memes from Reddit: Acquisition, Preparation and First Case Studies.

Thomas Schmidt Fabian Schiller Matthias Götz Christian Wolff

Published in: GI-Jahrestagung (2023)

Keyphrases

case study
lessons learned
open source
manually annotated
test set
real world
coreference resolution
acquisition process
literature review
data collection
high speed
text corpus
supervised machine learning
real time
open domain
newspaper articles
text corpora
natural language text
text data
development process
data acquisition
knowledge management
text mining
feature selection
genetic algorithm
data sets