TunBERT: Pretrained Contextualized Text Representation for Tunisian Dialect.
Abir MessaoudiAhmed CheikhrouhouHatem HaddadNourchene FerchichiMoez BenHajhmidaAbir KorchedMalek NaskiFaten GhrissAmine KerkeniPublished in: CoRR (2021)
Keyphrases
- text representation
- concept learning
- information filtering
- vector space model
- text classification
- text documents
- text categorization
- bag of words
- index terms
- text mining
- keywords
- document clustering
- text retrieval
- word sense disambiguation
- text clustering
- document representation
- image classification
- information retrieval
- information extraction
- data analysis
- retrieval systems
- co occurrence
- machine learning