RoBERTuito: a pre-trained language model for social media text in Spanish.
Juan Manuel PérezDamián Ariel FurmanLaura Alonso AlemanyFranco LuquePublished in: CoRR (2021)
Keyphrases
- language model
- pre trained
- social media
- information retrieval
- language modeling
- document retrieval
- speech recognition
- probabilistic model
- retrieval model
- n gram
- query expansion
- text retrieval
- test collection
- multiword
- text mining
- context sensitive
- mixture model
- smoothing methods
- text documents
- training examples
- training data
- question answering
- neural network
- keywords
- control signals
- translation model
- co occurrence
- support vector