A Corpus of Memes from Reddit: Acquisition, Preparation and First Case Studies.
Thomas SchmidtFabian SchillerMatthias GötzChristian WolffPublished in: GI-Jahrestagung (2023)
Keyphrases
- case study
- lessons learned
- open source
- manually annotated
- test set
- real world
- coreference resolution
- acquisition process
- literature review
- data collection
- high speed
- text corpus
- supervised machine learning
- real time
- open domain
- newspaper articles
- text corpora
- natural language text
- text data
- development process
- data acquisition
- knowledge management
- text mining
- feature selection
- genetic algorithm
- data sets