Token-wise Influential Training Data Retrieval for Large Language Models.
Huawei LinJikai LongZhaozhuo XuWeijie ZhaoPublished in: ACL (1) (2024)
Keyphrases
- language model
- data retrieval
- information retrieval
- language modeling
- document retrieval
- n gram
- databases
- language modelling
- probabilistic model
- data access
- retrieval model
- test collection
- speech recognition
- query expansion
- statistical language models
- query terms
- query processing
- translation model
- document ranking
- smoothing methods
- xml databases
- language models for information retrieval
- relevance model
- search engine
- pseudo relevance feedback
- information retrieval systems
- co occurrence
- text mining
- metadata
- machine learning