Charagram: Embedding Words and Sentences via Character n-grams.
John WietingMohit BansalKevin GimpelKaren LivescuPublished in: EMNLP (2016)
Keyphrases
- character n grams
- n gram
- variable length
- cross language information retrieval
- cross language
- language model
- language specific
- natural language
- optical character recognition
- language independent
- multiword
- arabic documents
- word segmentation
- document level
- cross lingual
- text retrieval
- sentence level
- linguistic features
- semantic roles
- bag of words
- text classification
- human motion
- query translation
- text documents
- out of vocabulary