BLOOM: A 176B-Parameter Open-Access Multilingual Language Model.
Teven Le ScaoAngela FanChristopher AkikiEllie PavlickSuzana IlicDaniel HesslowRoman CastagnéAlexandra Sasha LuccioniFrançois YvonMatthias GalléJonathan TowAlexander M. RushStella BidermanAlbert WebsonPawan Sasanka AmmanamanchiThomas WangBenoît SagotNiklas MuennighoffAlbert Villanova del MoralOlatunji RuwaseRachel BawdenStas BekmanAngelina McMillan-MajorIz BeltagyHuu NguyenLucile SaulnierSamson TanPedro Ortiz SuarezVictor SanhHugo LaurençonYacine JerniteJulien LaunayMargaret MitchellColin RaffelAaron GokaslanAdi SimhiAitor SoroaAlham Fikri AjiAmit AlfassyAnna RogersAriel Kreisberg NitzavCanwen XuChenghao MouChris EmezueChristopher KlammColin LeongDaniel van StrienDavid Ifeoluwa Adelaniet al.Published in: CoRR (2022)
Keyphrases
- language model
- open access
- language modeling
- cross lingual
- n gram
- document retrieval
- metadata
- probabilistic model
- information retrieval
- query expansion
- retrieval model
- speech recognition
- language independent
- language modelling
- mixture model
- dirichlet prior
- cross language
- ad hoc information retrieval
- relevance model
- context sensitive
- test collection
- statistical language models
- query terms
- smoothing methods
- digital libraries
- cross language information retrieval
- pseudo relevance feedback
- document length
- machine translation
- translation model
- health information
- text classification
- database