​
Login / Signup
Haoning Ye
Publication Activity (10 Years)
Years Active: 2023-2024
Publications (10 Years): 6
Top Topics
Language Model
Evaluation Process
Domain Knowledge
N Gram
Top Venues
CoRR
AAAI
</>
Publications
</>
Zhouhong Gu
,
Haoning Ye
,
Zeyang Zhou
,
Hongwei Feng
,
Yanghua Xiao
StrucText-Eval: An Autogenerated Benchmark for Evaluating Large Language Model's Ability in Structure-Rich Text Understanding.
CoRR
(2024)
Zhouhong Gu
,
Xiaoxuan Zhu
,
Haoning Ye
,
Lin Zhang
,
Jianchen Wang
,
Yixin Zhu
,
Sihang Jiang
,
Zhuozhi Xiong
,
Zihan Li
,
Weijie Wu
,
Qianyu He
,
Rui Xu
,
Wenhao Huang
,
Jingping Liu
,
Zili Wang
,
Shusen Wang
,
Weiguo Zheng
,
Hongwei Feng
,
Yanghua Xiao
Xiezhi: An Ever-Updating Benchmark for Holistic Domain Knowledge Evaluation.
AAAI
(2024)
Zhouhong Gu
,
Zihan Li
,
Lin Zhang
,
Zhuozhi Xiong
,
Sihang Jiang
,
Xiaoxuan Zhu
,
Shusen Wang
,
Zili Wang
,
Jianchen Wang
,
Haoning Ye
,
Wenhao Huang
,
Yikai Zhang
,
Hongwei Feng
,
Yanghua Xiao
Beyond the Obvious: Evaluating the Reasoning Ability In Real-life Scenarios of Language Models on Life Scapes Reasoning Benchmark~(LSR-Benchmark).
CoRR
(2023)
Zhouhong Gu
,
Xiaoxuan Zhu
,
Haoning Ye
,
Lin Zhang
,
Jianchen Wang
,
Sihang Jiang
,
Zhuozhi Xiong
,
Zihan Li
,
Qianyu He
,
Rui Xu
,
Wenhao Huang
,
Weiguo Zheng
,
Hongwei Feng
,
Yanghua Xiao
Xiezhi: An Ever-Updating Benchmark for Holistic Domain Knowledge Evaluation.
CoRR
(2023)
Zhouhong Gu
,
Xiaoxuan Zhu
,
Haoning Ye
,
Lin Zhang
,
Zhuozhi Xiong
,
Zihan Li
,
Qianyu He
,
Sihang Jiang
,
Hongwei Feng
,
Yanghua Xiao
Domain Mastery Benchmark: An Ever-Updating Benchmark for Evaluating Holistic Domain Knowledge of Large Language Model-A Preliminary Release.
CoRR
(2023)
Qianyu He
,
Jie Zeng
,
Wenhao Huang
,
Lina Chen
,
Jin Xiao
,
Qianxi He
,
Xunzhe Zhou
,
Lida Chen
,
Xintao Wang
,
Yuncheng Huang
,
Haoning Ye
,
Zihan Li
,
Shisong Chen
,
Yikai Zhang
,
Zhouhong Gu
,
Jiaqing Liang
,
Yanghua Xiao
Can Large Language Models Understand Real-World Complex Instructions?
CoRR
(2023)