ZEN 2.0: Continue Training and Adaption for N-gram Enhanced Text Encoders.
Yan SongTong ZhangYonggang WangKai-Fu LeePublished in: CoRR (2021)
Keyphrases
- n gram
- language model
- character n grams
- text classification
- language independent
- word level
- bag of words
- web documents
- variable length
- word segmentation
- language modelling
- text documents
- language modeling
- part of speech
- text mining
- viterbi algorithm
- language specific
- information retrieval
- text retrieval
- neural network
- text classifiers
- keywords
- document analysis
- co occurrence