The WeSearch Corpus, Treebank, and Treecache - A Comprehensive Sample of User-Generated Content.
Jonathon ReadDan FlickingerRebecca DridanStephan OepenLilja ØvrelidPublished in: LREC (2012)
Keyphrases
- user generated content
- social media
- penn treebank
- wikipedia articles
- user generated
- user comments
- wide coverage
- dependency parsing
- online social
- user contributed
- online social media
- relevant content
- recommender systems
- online forums
- tree bank
- pos tagging
- social media content
- hand crafted
- customer reviews
- website
- blog posts
- social networking sites
- natural language