Collecting and Annotating Indian Social Media Code-Mixed Corpora.
Anupam JamatiaBjörn GambäckAmitava DasPublished in: CICLing (2) (2016)
Keyphrases
- social media
- source code
- social networks
- social networking
- metadata
- natural language processing
- data collection
- user generated content
- social media data
- big data
- social media platforms
- machine learning
- java programs
- social media sites
- code generation
- software industry
- manual annotation
- semantic annotation
- information systems
- search engine