Aggression-annotated Corpus of Hindi-English Code-mixed Data.
Ritesh KumarAishwarya N. RegantiAkshit BhatiaTushar MaheshwariPublished in: LREC (2018)
Keyphrases
- mixed data
- annotated corpus
- named entity recognition
- named entities
- information extraction
- machine translation
- natural language processing
- relation extraction
- data compression
- knn
- natural language
- conditional random fields
- maximum entropy
- indian languages
- data sets
- semi supervised
- statistical machine translation
- similarity function
- automatic annotation
- clustering algorithm
- cross lingual
- cross language information retrieval
- multiscale
- high dimensional data
- unsupervised learning
- training data
- support vector machine