TunBERT: Pretrained Contextualized Text Representation for Tunisian Dialect.
Abir MessaoudiAhmed CheikhrouhouHatem HaddadNourchene FerchichiMoez BenHajhmidaAbir KorchedMalek NaskiFaten GhrissAmine KerkeniPublished in: ISPR (2022)
Keyphrases
- text representation
- information filtering
- concept learning
- bag of words
- index terms
- text categorization
- vector space model
- text classification
- text documents
- keywords
- document clustering
- text retrieval
- text mining
- document representation
- word sense disambiguation
- information extraction
- text clustering
- vector space
- co occurrence
- domain knowledge
- multiscale