Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / jeinlee1991/chinese-llm-benchmark issues and pull requests
#43 - 纯粹搞笑的评测, 收了百度多少钱?
Issue -
State: open - Opened by a5185330 2 days ago
#42 - 可否评测一下stepfun的系列模型
Issue -
State: open - Opened by forrestlinfeng 12 days ago
#41 - 可以增加llama3.1评测数据吗
Issue -
State: closed - Opened by Anionex about 1 month ago
- 3 comments
#41 - 可以增加llama3.1评测数据吗
Issue -
State: closed - Opened by Anionex about 1 month ago
- 3 comments
#40 - 能不能对各能力做一个详细的解释啊?
Issue -
State: open - Opened by Wooden-Gear 3 months ago
#40 - 能不能对各能力做一个详细的解释啊?
Issue -
State: open - Opened by Wooden-Gear 3 months ago
#39 - 开个 Nemotron-4 340B 评价
Issue -
State: open - Opened by wrench1997 3 months ago
#38 - 新增Yi-1.5系列模型的数据
Issue -
State: closed - Opened by zzc0208 4 months ago
- 1 comment
#37 - 10B以下的LLM排名不太准确,实际使用ChatGLM3-6B和Qwen1.5-7B表现更好
Issue -
State: open - Opened by danny-zhu 4 months ago
- 2 comments
#36 - 评测一下 deepseek v2
Issue -
State: closed - Opened by cubxxw 4 months ago
- 1 comment
#36 - 评测一下 deepseek v2
Issue -
State: closed - Opened by cubxxw 4 months ago
- 1 comment
#35 - 评测数据无法吐槽
Issue -
State: open - Opened by freedomRen 5 months ago
- 2 comments
#34 - 10b以下开源排名榜单不靠谱
Issue -
State: open - Opened by wyfSunflower 5 months ago
#34 - 10b以下开源排名榜单不靠谱
Issue -
State: open - Opened by wyfSunflower 5 months ago
#33 - 缺少重要的claude系列,申请加入相关测评
Issue -
State: open - Opened by chiguabaobao 6 months ago
- 2 comments
#32 - 能否加入qianwen1.5-32B的评测
Issue -
State: closed - Opened by yu-zheng-tao 6 months ago
- 2 comments
#32 - 能否加入qianwen1.5-32B的评测
Issue -
State: closed - Opened by yu-zheng-tao 6 months ago
- 2 comments
#31 - 能否加入Function Call(工具调用)能力指标评测
Issue -
State: open - Opened by Dream-s-Wang 6 months ago
- 1 comment
#30 - 讯飞星火13B开源模型测评
Issue -
State: open - Opened by STHSF 6 months ago
#29 - 可否增加claude3商用模型的评测
Issue -
State: open - Opened by yu-zheng-tao 6 months ago
#28 - 为什么千问1.5-14B-chat分这么高,比72b还高?
Issue -
State: closed - Opened by yu-zheng-tao 6 months ago
- 4 comments
#27 - 为什么千问1.5-14B-chat分这么高,比72b还高?
Issue -
State: closed - Opened by yu-zheng-tao 6 months ago
#26 - 可否将kimi chat加入榜单
Issue -
State: closed - Opened by LengmoAngel 7 months ago
- 1 comment
#25 - 建议增加1B模型测试
Issue -
State: closed - Opened by yuys0602 7 months ago
- 1 comment
#24 - 讯飞星火推出3.5版本
Issue -
State: closed - Opened by zhisuyan 8 months ago
- 1 comment
#23 - Is there any arxiv paper or report for this benchmark?
Issue -
State: open - Opened by zhimin-z 8 months ago
#22 - update new model
Issue -
State: closed - Opened by zzc0208 8 months ago
#21 - 可以测试一下openbuddy-deepseek-67b-v15.2
Issue -
State: closed - Opened by openmynet 9 months ago
- 1 comment
#20 - 文心一言的新版本复测
Issue -
State: closed - Opened by huanghuanhuahuh 10 months ago
- 1 comment
#19 - What is the evaluation criteria for the score?
Issue -
State: open - Opened by zhimin-z 10 months ago
#18 - This link does not redirect...
Issue -
State: open - Opened by zhimin-z 10 months ago
#17 - Why does data analysis evaluation not count into the overall score?
Issue -
State: open - Opened by zhimin-z 10 months ago
#16 - 强烈建议加入moonshot的Kimi chat!!!
Issue -
State: closed - Opened by witherlll 10 months ago
- 2 comments
#15 - 我Claude呢?
Issue -
State: open - Opened by JiangKaslana about 1 year ago
#14 - 评测数据太少了吧,这能说明问题?
Issue -
State: open - Opened by yyl424525 about 1 year ago
- 1 comment
#13 - How should I cite this work?
Issue -
State: open - Opened by g-h-chen about 1 year ago
#12 - 如果有各个模型的部署硬件要求对比就好了
Issue -
State: open - Opened by zhangmianhongni about 1 year ago
#11 - 可以评测一下Chinese-LLaMA-Alpaca-2吗
Issue -
State: open - Opened by dodogreen about 1 year ago
#10 - 可以评测一下千问-7B模型吗
Issue -
State: closed - Opened by liudayiheng about 1 year ago
#9 - 很棒的测评,请问项目主测试数据可以转载吗
Issue -
State: closed - Opened by l269438 about 1 year ago
- 1 comment
#8 - 通义千问的评测时间?
Issue -
State: closed - Opened by liudayiheng about 1 year ago
#7 - 很好的工作,不知道未来有将Anima-30B模型列入评测计划么?
Issue -
State: open - Opened by UI233 about 1 year ago
#6 - 希望能够增加RWKV模型进行评测
Issue -
State: open - Opened by OopsYouDiedE about 1 year ago
- 3 comments
#5 - 提供结果复现代码
Issue -
State: open - Opened by azmat21 about 1 year ago
#4 - 如何提交自己的模型进行评测?
Issue -
State: open - Opened by Taoooo9 about 1 year ago
- 1 comment
#3 - eval中是所有评测数据吗
Issue -
State: closed - Opened by TTCoding over 1 year ago
- 1 comment
#2 - 很棒的工作, 请问评分标准是怎么样的呢?是如何给这些模型打分的?
Issue -
State: open - Opened by wwngh1233 over 1 year ago
- 7 comments
#1 - 请问为什么没有bing?
Issue -
State: closed - Opened by tutianyu101 over 1 year ago
- 1 comment