byteSteady: Fast Classification Using Byte-Level n-Gram Embeddings.
Xiang ZhangAlexandre DrouinRaymond LiPublished in: CoRR (2021)
Keyphrases
- n gram
- text classification
- language model
- classification accuracy
- decision trees
- variable length
- language modeling
- language modelling
- feature selection
- support vector
- word segmentation
- image classification
- bag of words
- language independent
- feature vectors
- text categorization
- word level
- part of speech
- neural network
- support vector machine
- feature space
- machine learning
- information extraction
- hidden markov models
- data analysis
- viterbi algorithm
- databases
- inside outside algorithm