GitHub / codefuse-ai/mftcoder issues and pull requests

#88 - 运行accelerate launch --config_file accelerate_ds_config.yaml pefts/mft_accelerate.py --train_config configs/coba_train_config.json --distributed_type "DeepSpeed"报错

Issue - State: open - Opened by SYVAE 4 months ago

#87 - 数据集下载

Issue - State: open - Opened by wzw-nudt 4 months ago

#86 - 基于qwen模型使用coba训练后，权重合并错误

Issue - State: open - Opened by ChaohuaZhang 7 months ago - 1 comment

#85 - 请问CoBa论文中，训练参数是如何设置的

Issue - State: open - Opened by daidaiershidi 9 months ago - 1 comment

#84 - 0.5.dev pr

Pull Request - State: closed - Opened by chencyudel 9 months ago

#83 - update tutorial of CoBa arguments

Pull Request - State: closed - Opened by GoneZ5 9 months ago

#82 - update the tutorial of CoBa

Pull Request - State: closed - Opened by GoneZ5 9 months ago

#81 - Support coba loss

Pull Request - State: closed - Opened by GoneZ5 9 months ago

#80 - docs: add Japanese README

Pull Request - State: open - Opened by eltociear 9 months ago

#79 - [ModelCache]提供多种存储方案能力。

Issue - State: closed - Opened by peng3307165 10 months ago

#78 - 【MFTCoder】适配deepseek-v2

Issue - State: closed - Opened by wj882018 10 months ago

#77 - 【MFTCoder】撰写某领域MFT最佳实践tutorial

Issue - State: closed - Opened by wj882018 10 months ago

#76 - 【MFTCoder】增加usage example文档

Issue - State: closed - Opened by wj882018 10 months ago - 2 comments

#75 - 【MFTCoder】增加与其他微调工具框架（例如llama-factory）的对比结果文档

Issue - State: closed - Opened by wj882018 10 months ago

#74 - 【MFTCoder】尝试用更多其他的开源数据集，在mftcoder上进行实验，对比MFT和SFT效果

Issue - State: closed - Opened by wj882018 10 months ago

#73 - 【MFTCoder】在mftcoder框架下实现更多的mft算法

Issue - State: closed - Opened by wj882018 10 months ago

#72 - 【MFTCoder】在mftcoder框架下实现更多的mft算法

Issue - State: closed - Opened by wj882018 10 months ago

#71 - Update README_cn.md

Pull Request - State: closed - Opened by chencyudel 10 months ago

#70 - Update README_cn.md

Pull Request - State: closed - Opened by chencyudel 10 months ago

#69 - Update README_cn.md

Pull Request - State: closed - Opened by chencyudel 10 months ago

#68 - Add files via upload

Pull Request - State: closed - Opened by chencyudel 10 months ago

#67 - Add files via upload

Pull Request - State: closed - Opened by wj882018 10 months ago

#66 - Add files via upload

Pull Request - State: closed - Opened by wj882018 10 months ago

#65 - model type

Issue - State: open - Opened by XiaoMaGe-hero 12 months ago - 1 comment

#64 - 实验 MFTCoder 的效果总是不尽人意

Issue - State: open - Opened by Chaochao2020 about 1 year ago - 2 comments

#63 - Update requirements

Pull Request - State: closed - Opened by chencyudel about 1 year ago

#62 - readme

Pull Request - State: closed - Opened by chencyudel about 1 year ago

#61 - bugfix, remove default tensorboard writer to avoid permission issue

Pull Request - State: closed - Opened by chencyudel about 1 year ago

#60 - mftcoder 新版 Permission denied: '/home/admin' BUG

Issue - State: closed - Opened by Chaochao2020 about 1 year ago - 5 comments

#59 - mftcoder使用humaneval评估

Issue - State: open - Opened by lwh8915 about 1 year ago - 1 comment

#58 - V0.4.dev

Pull Request - State: closed - Opened by chencyudel about 1 year ago

#57 - RuntimeError: CUDA error: invalid device ordinal

Issue - State: closed - Opened by lwh8915 about 1 year ago - 1 comment

#56 - Update README.md

Pull Request - State: closed - Opened by twelveand0 about 1 year ago

#55 - 数据集loss 下降不均衡如何处理

Issue - State: closed - Opened by huangmenglong about 1 year ago - 1 comment

#54 - Update README_cn.md

Pull Request - State: closed - Opened by twelveand0 over 1 year ago

#53 - Update README.md

Pull Request - State: closed - Opened by twelveand0 over 1 year ago

#52 - convergence curves

Issue - State: closed - Opened by twelveand0 over 1 year ago

#51 - MFTCoder论文中训练数据集

Issue - State: closed - Opened by superqing001 over 1 year ago - 2 comments

#50 - Update README_cn.md to add Join-Us section

Pull Request - State: closed - Opened by twelveand0 over 1 year ago

#49 - Update README.md to add Join-Us section

Pull Request - State: closed - Opened by twelveand0 over 1 year ago

#48 - 任务的类型也是用gpt来生成的吗？

Issue - State: closed - Opened by shatealaboxiaowang over 1 year ago - 1 comment

#47 - How can i do continue pretraining?

Issue - State: open - Opened by hwaking over 1 year ago - 1 comment

#46 - Update MFTCoder chat template

Pull Request - State: closed - Opened by chencyudel over 1 year ago

#45 - 模型微调完，合并时报错 NotImplementedError: Cannot copy out of meta tensor; no data!

Issue - State: closed - Opened by xxyp over 1 year ago - 2 comments

#44 - Update README.md

Pull Request - State: closed - Opened by twelveand0 over 1 year ago

#43 - 请问多机训练需要怎么修改？

Issue - State: closed - Opened by jy00161yang over 1 year ago - 1 comment

#42 - qlora微调合并权重时出错

Issue - State: closed - Opened by fangzexian over 1 year ago - 4 comments

#41 - Update README.md

Pull Request - State: closed - Opened by chencyudel over 1 year ago

#40 - readme

Pull Request - State: closed - Opened by chencyudel over 1 year ago

#39 - change jpg

Pull Request - State: closed - Opened by chencyudel over 1 year ago

#38 - Update README.md

Pull Request - State: closed - Opened by jglee2046 over 1 year ago

#37 - V0.3.0 dev merge

Pull Request - State: closed - Opened by chencyudel over 1 year ago

#36 - 请问下是否支持Wandb或者Tensorboard

Issue - State: closed - Opened by pydaxing over 1 year ago - 1 comment

#35 - 请问要支持chatglm3-6b-base的话需要哪些更改

Issue - State: closed - Opened by kevindany over 1 year ago - 2 comments

#34 - 请教4int的gptq模型能不能进行lora微调

Issue - State: closed - Opened by wengyuan722 over 1 year ago - 4 comments

#33 - 模型是否支持商用

Issue - State: closed - Opened by zhangyukun230 over 1 year ago - 5 comments

#32 - ValueError: Asking to pad but the tokenizer does not have a padding token. Please select a token to use as `pad_token` `(tokenizer.pad_token = tokenizer.eos_token e.g.)` or add a new pad token via `tokenizer.add_special_tokens({'pad_token': '[PAD]'})`.

Issue - State: closed - Opened by sxsxsx over 1 year ago - 9 comments

#31 - nccl 报错了

Issue - State: closed - Opened by belle9217 over 1 year ago - 3 comments

#30 - no 7B model size?

Issue - State: closed - Opened by yiyepiaoling0715 over 1 year ago - 2 comments

#29 - json.decoder.JSONDecodeError: Expecting value: line 1 column 2 (char 1)

Issue - State: closed - Opened by sxsxsx over 1 year ago - 5 comments

#28 - 单卡v1000，微调报错

Issue - State: closed - Opened by sxsxsx over 1 year ago - 2 comments

#27 - Something wrong when run 'bash run_bash.sh'

Issue - State: closed - Opened by MaoYouSi over 1 year ago - 1 comment

#26 - 数据问题ValueError: data format not supported, please use prompt/answer, or chatML or pretrain text

Issue - State: closed - Opened by mst272 over 1 year ago

#25 - 代码中对于3.5 Multitask Fine-Tuning with Balanced Losses的具体实现的位置（只找到了第一个loss的实现）

Issue - State: closed - Opened by YanqiDai over 1 year ago - 2 comments

#24 - NotImplementedError: Cannot copy out of meta tensor; no data!

Issue - State: closed - Opened by zzb2019053515 over 1 year ago - 1 comment

#23 - 如何构建codefuse-llamacode的提问和终止符

Issue - State: closed - Opened by wengyuan722 over 1 year ago - 29 comments

#22 - loss计算那里 RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn

Issue - State: closed - Opened by hhy150 over 1 year ago - 3 comments

#21 - little bug fix meet

Issue - State: closed - Opened by elcky over 1 year ago - 2 comments

#20 - safetensors_rust.SafetensorError: Error while deserializing header: HeaderTooLarge

Issue - State: closed - Opened by zzb2019053515 over 1 year ago - 3 comments

#19 - 模型训练没有进度条

Issue - State: closed - Opened by liujingqiao over 1 year ago

#18 - 在codellama上微调的性能没有提升

Issue - State: closed - Opened by HPRCEST over 1 year ago - 2 comments

#17 - Update README.md

Pull Request - State: closed - Opened by huybery over 1 year ago

#16 - 请问，对模型进行多任务微调该怎么设计jsonl数据集？

Issue - State: closed - Opened by a793181018 over 1 year ago - 5 comments

#15 - Inquiry about weighted_loss_mode

Issue - State: closed - Opened by tszdanger over 1 year ago - 1 comment

#14 - 请问FSDP的训练API啥时候会开源出来

Issue - State: closed - Opened by peiji1981 over 1 year ago - 1 comment

#13 - 麻烦我想问下一个可行性问题，对CodeFuse-CodeGeeX2-6B进行微调时是否可以使用peft的方式中chatglm2 config进行微调？万分感谢🙏

Issue - State: closed - Opened by whyPeanutbutter over 1 year ago - 1 comment

#12 - readme.txt指出，训练数据为jsonl格式，参考项目中的xxx.jsonl文件。未搜到对应的参考jsonl文件，能否麻烦给出一个示例？谢谢🙏

Issue - State: closed - Opened by whyPeanutbutter over 1 year ago - 2 comments

#11 - data.helper 无法加载？

Issue - State: closed - Opened by liudonglei over 1 year ago - 4 comments

#10 - about focal loss mentioned in the paper

Issue - State: closed - Opened by iDonal over 1 year ago - 1 comment

#9 - 能否写一个完整的微调例子？

Issue - State: closed - Opened by liudonglei over 1 year ago - 1 comment

#8 - 使用lora + zero3微调CodeFuse-CodeLlama-34B后，合并模型失败

Issue - State: closed - Opened by 3m123 over 1 year ago - 3 comments

#7 - MFTCoder微调codefuse34b模型后，发现模型代码补全这块的回复能力就没了，求解决方案

Issue - State: closed - Opened by yangyubin1 over 1 year ago - 3 comments

#6 - 基于chatgpt生成的高质量python练习题数据是如何获取呀

Issue - State: closed - Opened by 18liumin almost 2 years ago - 1 comment

#5 - Update README_cn.md

Pull Request - State: closed - Opened by wj882018 almost 2 years ago

#4 - Update README.md

Pull Request - State: closed - Opened by wj882018 almost 2 years ago

#3 - HumanEval测试的Pass@1不高

Issue - State: closed - Opened by wangzhao88 almost 2 years ago - 2 comments

#2 - 国内下载方式

Issue - State: closed - Opened by wuyihz almost 2 years ago - 1 comment

#1 - 训练数据包含中文数据吗

Issue - State: closed - Opened by smashfan almost 2 years ago - 1 comment