Crowdsourcing High-Quality Parallel Data Extraction from Twitter.
Wang LingLuís MarujoChris DyerAlan W. BlackIsabel TrancosoPublished in: WMT@ACL (2014)
Keyphrases
- data extraction
- high quality
- semi structured
- web data extraction
- data integration
- mechanical turk
- web sources
- social media
- social networks
- information extraction
- databases
- web pages
- online social networks
- html pages
- social networking
- structured data
- distributed databases
- user queries
- high dimensional
- decision making
- machine learning
- real world