Aggression-annotated Corpus of Hindi-English Code-mixed Data.
Ritesh KumarAishwarya N. RegantiAkshit BhatiaTushar MaheshwariPublished in: CoRR (2018)
Keyphrases
- mixed data
- annotated corpus
- named entity recognition
- named entities
- machine translation
- information extraction
- natural language processing
- relation extraction
- data sets
- data compression
- maximum entropy
- automatic annotation
- natural language
- semi supervised
- knn
- statistical machine translation
- indian languages
- conditional random fields
- similarity function
- cross lingual
- clustering algorithm
- image segmentation
- similarity measure
- text mining
- artificial intelligence
- weakly supervised
- domain specific
- hidden markov models