N-gram and Word2Vec Feature Engineering Approaches for Spam Recognition on Some Influential Twitter Topics in Saudi Arabia.
Ahmed M. BalfagihVlado KeseljStacey TaylorPublished in: ICISDM (2022)
Keyphrases
- n gram
- text classification
- feature engineering
- language model
- saudi arabia
- language independent
- language modeling
- variable length
- word segmentation
- word level
- word recognition
- language specific
- viterbi algorithm
- information retrieval
- character n grams
- text categorization
- web documents
- multimedia
- machine learning
- learning process
- document analysis
- dependency parsing
- artificial intelligence
- keywords
- natural language
- social media
- active learning