FELM: Benchmarking Factuality Evaluation of Large Language Models.

Shiqi Chen Yiran Zhao Jinghan Zhang I-Chun Chern Siyang Gao Pengfei Liu Junxian He

Published in: CoRR (2023)

Keyphrases

language model
language modeling
document retrieval
probabilistic model
n gram
speech recognition
test collection
information retrieval
context sensitive
retrieval model
language modelling
query expansion
statistical language models
document ranking
query terms
vector space model
pseudo relevance feedback
smoothing methods
language model for information retrieval
language models for information retrieval
document length
retrieval effectiveness
translation model
passage retrieval
relevance assessments
relevance model
machine learning