jeinlee1991/chinese-llm-benchmark issues and pull requests

#43 - 纯粹搞笑的评测, 收了百度多少钱?

Issue - State: open - Opened by a5185330 2 days ago

#42 - 可否评测一下stepfun的系列模型

Issue - State: open - Opened by forrestlinfeng 12 days ago

#41 - 可以增加llama3.1评测数据吗

Issue - State: closed - Opened by Anionex about 1 month ago - 3 comments

#41 - 可以增加llama3.1评测数据吗

Issue - State: closed - Opened by Anionex about 1 month ago - 3 comments

#40 - 能不能对各能力做一个详细的解释啊？

Issue - State: open - Opened by Wooden-Gear 3 months ago

#40 - 能不能对各能力做一个详细的解释啊？

Issue - State: open - Opened by Wooden-Gear 3 months ago

#39 - 开个 Nemotron-4 340B 评价

Issue - State: open - Opened by wrench1997 3 months ago

#38 - 新增Yi-1.5系列模型的数据

Issue - State: closed - Opened by zzc0208 4 months ago - 1 comment

#37 - 10B以下的LLM排名不太准确，实际使用ChatGLM3-6B和Qwen1.5-7B表现更好

Issue - State: open - Opened by danny-zhu 4 months ago - 2 comments

#36 - 评测一下 deepseek v2

Issue - State: closed - Opened by cubxxw 4 months ago - 1 comment

#36 - 评测一下 deepseek v2

Issue - State: closed - Opened by cubxxw 4 months ago - 1 comment

#35 - 评测数据无法吐槽

Issue - State: open - Opened by freedomRen 5 months ago - 2 comments

#34 - 10b以下开源排名榜单不靠谱

Issue - State: open - Opened by wyfSunflower 5 months ago

#34 - 10b以下开源排名榜单不靠谱

Issue - State: open - Opened by wyfSunflower 5 months ago

#33 - 缺少重要的claude系列，申请加入相关测评

Issue - State: open - Opened by chiguabaobao 6 months ago - 2 comments

#32 - 能否加入qianwen1.5-32B的评测

Issue - State: closed - Opened by yu-zheng-tao 6 months ago - 2 comments

#32 - 能否加入qianwen1.5-32B的评测

Issue - State: closed - Opened by yu-zheng-tao 6 months ago - 2 comments

#31 - 能否加入Function Call（工具调用）能力指标评测

Issue - State: open - Opened by Dream-s-Wang 6 months ago - 1 comment

#30 - 讯飞星火13B开源模型测评

Issue - State: open - Opened by STHSF 6 months ago

#29 - 可否增加claude3商用模型的评测

Issue - State: open - Opened by yu-zheng-tao 6 months ago

#28 - 为什么千问1.5-14B-chat分这么高，比72b还高？

Issue - State: closed - Opened by yu-zheng-tao 6 months ago - 4 comments

#27 - 为什么千问1.5-14B-chat分这么高，比72b还高？

Issue - State: closed - Opened by yu-zheng-tao 6 months ago

#26 - 可否将kimi chat加入榜单

Issue - State: closed - Opened by LengmoAngel 7 months ago - 1 comment

#25 - 建议增加1B模型测试

Issue - State: closed - Opened by yuys0602 7 months ago - 1 comment

#24 - 讯飞星火推出3.5版本

Issue - State: closed - Opened by zhisuyan 8 months ago - 1 comment

#23 - Is there any arxiv paper or report for this benchmark?

Issue - State: open - Opened by zhimin-z 8 months ago

#22 - update new model

Issue - State: closed - Opened by zzc0208 8 months ago

#21 - 可以测试一下openbuddy-deepseek-67b-v15.2

Issue - State: closed - Opened by openmynet 9 months ago - 1 comment

#20 - 文心一言的新版本复测

Issue - State: closed - Opened by huanghuanhuahuh 10 months ago - 1 comment

#19 - What is the evaluation criteria for the score?

Issue - State: open - Opened by zhimin-z 10 months ago

#18 - This link does not redirect...

Issue - State: open - Opened by zhimin-z 10 months ago

#17 - Why does data analysis evaluation not count into the overall score?

Issue - State: open - Opened by zhimin-z 10 months ago

#16 - 强烈建议加入moonshot的Kimi chat！！！

Issue - State: closed - Opened by witherlll 10 months ago - 2 comments

#15 - 我Claude呢？

Issue - State: open - Opened by JiangKaslana about 1 year ago

#14 - 评测数据太少了吧，这能说明问题？

Issue - State: open - Opened by yyl424525 about 1 year ago - 1 comment

#13 - How should I cite this work?

Issue - State: open - Opened by g-h-chen about 1 year ago

#12 - 如果有各个模型的部署硬件要求对比就好了

Issue - State: open - Opened by zhangmianhongni about 1 year ago

#11 - 可以评测一下Chinese-LLaMA-Alpaca-2吗

Issue - State: open - Opened by dodogreen about 1 year ago

#10 - 可以评测一下千问-7B模型吗

Issue - State: closed - Opened by liudayiheng about 1 year ago

#9 - 很棒的测评，请问项目主测试数据可以转载吗

Issue - State: closed - Opened by l269438 about 1 year ago - 1 comment

#8 - 通义千问的评测时间？

Issue - State: closed - Opened by liudayiheng about 1 year ago

#7 - 很好的工作，不知道未来有将Anima-30B模型列入评测计划么？

Issue - State: open - Opened by UI233 about 1 year ago

#6 - 希望能够增加RWKV模型进行评测

Issue - State: open - Opened by OopsYouDiedE about 1 year ago - 3 comments

#5 - 提供结果复现代码

Issue - State: open - Opened by azmat21 about 1 year ago

#4 - 如何提交自己的模型进行评测？

Issue - State: open - Opened by Taoooo9 about 1 year ago - 1 comment

#3 - eval中是所有评测数据吗

Issue - State: closed - Opened by TTCoding over 1 year ago - 1 comment

#2 - 很棒的工作，请问评分标准是怎么样的呢？是如何给这些模型打分的？

Issue - State: open - Opened by wwngh1233 over 1 year ago - 7 comments

#1 - 请问为什么没有bing？

Issue - State: closed - Opened by tutianyu101 over 1 year ago - 1 comment

GitHub / jeinlee1991/chinese-llm-benchmark issues and pull requests