LEGION: Harnessing Pre-trained Language Models for GitHub Topic Recommendations with Distribution-Balance Loss.
Yen-Trang DangThanh Le-CongPhuc-Thanh NguyenAnh M. T. BuiPhuong T. NguyenBach LeQuyet-Thang HuynhPublished in: CoRR (2024)
Keyphrases
- language model
- pre trained
- language modeling
- n gram
- document retrieval
- information retrieval
- retrieval model
- probabilistic model
- speech recognition
- language modelling
- expert finding
- smoothing methods
- training data
- query expansion
- statistical language models
- recommender systems
- training examples
- test collection
- document ranking
- language models for information retrieval
- relevance model
- data sets
- latent dirichlet allocation
- topic models
- multi modal
- hidden markov models
- search engine
- control signals
- query topic
- machine learning
- neural network