FlauBERT : des modèles de langue contextualisés pré-entraînés pour le français (FlauBERT : Unsupervised Language Model Pre-training for French).
Hang LeLoïc VialJibril FrejVincent SegonneMaximin CoavouxBenjamin LecouteuxAlexandre AllauzenBenoît CrabbéLaurent BesacierDidier SchwabPublished in: JEP-TALN-RECITAL (2) (2020)
Keyphrases
- language model
- language modeling
- n gram
- supervised learning
- probabilistic model
- language modelling
- speech recognition
- retrieval model
- document retrieval
- context sensitive
- information retrieval
- test collection
- ad hoc information retrieval
- query expansion
- mixture model
- statistical language models
- document ranking
- query terms
- language model for information retrieval
- unsupervised learning
- statistical machine translation
- pseudo relevance feedback
- cross lingual
- vector space model
- training set
- relevance model
- active learning
- query specific
- semi supervised
- word clouds
- information retrieval systems
- search engine
- translation model
- knn
- smoothing methods
- machine learning
- co occurrence
- language models for information retrieval