An open API service for providing issue and pull request metadata for open source projects.

GitHub / codefuse-ai/mftcoder issues and pull requests

#87 - 数据集下载

Issue - State: open - Opened by wzw-nudt 4 months ago

#86 - 基于qwen模型使用coba训练后,权重合并错误

Issue - State: open - Opened by ChaohuaZhang 7 months ago - 1 comment

#85 - 请问CoBa论文中,训练参数是如何设置的

Issue - State: open - Opened by daidaiershidi 9 months ago - 1 comment

#84 - 0.5.dev pr

Pull Request - State: closed - Opened by chencyudel 9 months ago

#83 - update tutorial of CoBa arguments

Pull Request - State: closed - Opened by GoneZ5 9 months ago

#82 - update the tutorial of CoBa

Pull Request - State: closed - Opened by GoneZ5 9 months ago

#81 - Support coba loss

Pull Request - State: closed - Opened by GoneZ5 9 months ago

#80 - docs: add Japanese README

Pull Request - State: open - Opened by eltociear 9 months ago

#79 - [ModelCache]提供多种存储方案能力。

Issue - State: closed - Opened by peng3307165 10 months ago

#78 - 【MFTCoder】适配deepseek-v2

Issue - State: closed - Opened by wj882018 10 months ago

#77 - 【MFTCoder】撰写某领域MFT最佳实践tutorial

Issue - State: closed - Opened by wj882018 10 months ago

#76 - 【MFTCoder】增加usage example文档

Issue - State: closed - Opened by wj882018 10 months ago - 2 comments

#71 - Update README_cn.md

Pull Request - State: closed - Opened by chencyudel 10 months ago

#70 - Update README_cn.md

Pull Request - State: closed - Opened by chencyudel 10 months ago

#69 - Update README_cn.md

Pull Request - State: closed - Opened by chencyudel 10 months ago

#68 - Add files via upload

Pull Request - State: closed - Opened by chencyudel 10 months ago

#67 - Add files via upload

Pull Request - State: closed - Opened by wj882018 10 months ago

#66 - Add files via upload

Pull Request - State: closed - Opened by wj882018 10 months ago

#65 - model type

Issue - State: open - Opened by XiaoMaGe-hero 12 months ago - 1 comment

#64 - 实验 MFTCoder 的效果总是不尽人意

Issue - State: open - Opened by Chaochao2020 about 1 year ago - 2 comments

#63 - Update requirements

Pull Request - State: closed - Opened by chencyudel about 1 year ago

#62 - readme

Pull Request - State: closed - Opened by chencyudel about 1 year ago

#61 - bugfix, remove default tensorboard writer to avoid permission issue

Pull Request - State: closed - Opened by chencyudel about 1 year ago

#60 - mftcoder 新版 Permission denied: '/home/admin' BUG

Issue - State: closed - Opened by Chaochao2020 about 1 year ago - 5 comments

#59 - mftcoder使用humaneval评估

Issue - State: open - Opened by lwh8915 about 1 year ago - 1 comment

#58 - V0.4.dev

Pull Request - State: closed - Opened by chencyudel about 1 year ago

#57 - RuntimeError: CUDA error: invalid device ordinal

Issue - State: closed - Opened by lwh8915 about 1 year ago - 1 comment

#56 - Update README.md

Pull Request - State: closed - Opened by twelveand0 about 1 year ago

#55 - 数据集loss 下降不均衡如何处理

Issue - State: closed - Opened by huangmenglong about 1 year ago - 1 comment

#54 - Update README_cn.md

Pull Request - State: closed - Opened by twelveand0 over 1 year ago

#53 - Update README.md

Pull Request - State: closed - Opened by twelveand0 over 1 year ago

#52 - convergence curves

Issue - State: closed - Opened by twelveand0 over 1 year ago

#51 - MFTCoder论文中训练数据集

Issue - State: closed - Opened by superqing001 over 1 year ago - 2 comments

#50 - Update README_cn.md to add Join-Us section

Pull Request - State: closed - Opened by twelveand0 over 1 year ago

#49 - Update README.md to add Join-Us section

Pull Request - State: closed - Opened by twelveand0 over 1 year ago

#48 - 任务的类型也是用gpt来生成的吗?

Issue - State: closed - Opened by shatealaboxiaowang over 1 year ago - 1 comment

#47 - How can i do continue pretraining?

Issue - State: open - Opened by hwaking over 1 year ago - 1 comment

#46 - Update MFTCoder chat template

Pull Request - State: closed - Opened by chencyudel over 1 year ago

#44 - Update README.md

Pull Request - State: closed - Opened by twelveand0 over 1 year ago

#43 - 请问多机训练需要怎么修改?

Issue - State: closed - Opened by jy00161yang over 1 year ago - 1 comment

#42 - qlora微调合并权重时出错

Issue - State: closed - Opened by fangzexian over 1 year ago - 4 comments

#41 - Update README.md

Pull Request - State: closed - Opened by chencyudel over 1 year ago

#40 - readme

Pull Request - State: closed - Opened by chencyudel over 1 year ago

#39 - change jpg

Pull Request - State: closed - Opened by chencyudel over 1 year ago

#38 - Update README.md

Pull Request - State: closed - Opened by jglee2046 over 1 year ago

#37 - V0.3.0 dev merge

Pull Request - State: closed - Opened by chencyudel over 1 year ago

#36 - 请问下是否支持Wandb或者Tensorboard

Issue - State: closed - Opened by pydaxing over 1 year ago - 1 comment

#35 - 请问要支持chatglm3-6b-base的话需要哪些更改

Issue - State: closed - Opened by kevindany over 1 year ago - 2 comments

#34 - 请教4int的gptq模型能不能进行lora微调

Issue - State: closed - Opened by wengyuan722 over 1 year ago - 4 comments

#33 - 模型是否支持商用

Issue - State: closed - Opened by zhangyukun230 over 1 year ago - 5 comments

#31 - nccl 报错了

Issue - State: closed - Opened by belle9217 over 1 year ago - 3 comments

#30 - no 7B model size?

Issue - State: closed - Opened by yiyepiaoling0715 over 1 year ago - 2 comments

#29 - json.decoder.JSONDecodeError: Expecting value: line 1 column 2 (char 1)

Issue - State: closed - Opened by sxsxsx over 1 year ago - 5 comments

#28 - 单卡v1000,微调报错

Issue - State: closed - Opened by sxsxsx over 1 year ago - 2 comments

#27 - Something wrong when run 'bash run_bash.sh'

Issue - State: closed - Opened by MaoYouSi over 1 year ago - 1 comment

#24 - NotImplementedError: Cannot copy out of meta tensor; no data!

Issue - State: closed - Opened by zzb2019053515 over 1 year ago - 1 comment

#23 - 如何构建codefuse-llamacode的提问和终止符

Issue - State: closed - Opened by wengyuan722 over 1 year ago - 29 comments

#21 - little bug fix meet

Issue - State: closed - Opened by elcky over 1 year ago - 2 comments

#19 - 模型训练没有进度条

Issue - State: closed - Opened by liujingqiao over 1 year ago

#18 - 在codellama上微调的性能没有提升

Issue - State: closed - Opened by HPRCEST over 1 year ago - 2 comments

#17 - Update README.md

Pull Request - State: closed - Opened by huybery over 1 year ago

#16 - 请问,对模型进行多任务微调该怎么设计jsonl数据集?

Issue - State: closed - Opened by a793181018 over 1 year ago - 5 comments

#15 - Inquiry about weighted_loss_mode

Issue - State: closed - Opened by tszdanger over 1 year ago - 1 comment

#14 - 请问FSDP的训练API啥时候会开源出来

Issue - State: closed - Opened by peiji1981 over 1 year ago - 1 comment

#11 - data.helper 无法加载?

Issue - State: closed - Opened by liudonglei over 1 year ago - 4 comments

#10 - about focal loss mentioned in the paper

Issue - State: closed - Opened by iDonal over 1 year ago - 1 comment

#9 - 能否写一个完整的微调例子?

Issue - State: closed - Opened by liudonglei over 1 year ago - 1 comment

#8 - 使用lora + zero3微调CodeFuse-CodeLlama-34B后,合并模型失败

Issue - State: closed - Opened by 3m123 over 1 year ago - 3 comments

#6 - 基于chatgpt生成的高质量python练习题数据是如何获取呀

Issue - State: closed - Opened by 18liumin almost 2 years ago - 1 comment

#5 - Update README_cn.md

Pull Request - State: closed - Opened by wj882018 almost 2 years ago

#4 - Update README.md

Pull Request - State: closed - Opened by wj882018 almost 2 years ago

#3 - HumanEval测试的Pass@1不高

Issue - State: closed - Opened by wangzhao88 almost 2 years ago - 2 comments

#2 - 国内下载方式

Issue - State: closed - Opened by wuyihz almost 2 years ago - 1 comment

#1 - 训练数据包含中文数据吗

Issue - State: closed - Opened by smashfan almost 2 years ago - 1 comment