Ranking of Large Language Models' Hallucination Control Ability in Chinese-language Contexts
by Zhenhui(Jack) Jiang1, Yi
Lu1, Yifan Wu1, Haozhe
Xu2, Zhengyu Wu1, Jiaxin
Li1 /
蒋镇辉1,鲁艺1,吴轶凡1,徐昊哲2,武正昱1,李佳欣1
1HKU Business
School,2The School of Management,
Xi’an Jiaotong University
The full report can be accessed
HERE.
Leaderboard
|
Rank
|
Model
Name
|
Factual
Hallucination
|
Faithful
Hallucination
|
Final
Score
|
|---|---|---|---|---|
|
1
|
GPT
5(Thinking)
|
72
|
100
|
86
|