Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / codefuse-ai/mftcoder issues and pull requests
#84 - 0.5.dev pr
Pull Request -
State: closed - Opened by chencyudel 14 days ago
#83 - update tutorial of CoBa arguments
Pull Request -
State: closed - Opened by GoneZ5 15 days ago
#82 - update the tutorial of CoBa
Pull Request -
State: closed - Opened by GoneZ5 16 days ago
#81 - Support coba loss
Pull Request -
State: closed - Opened by GoneZ5 16 days ago
#80 - docs: add Japanese README
Pull Request -
State: open - Opened by eltociear 26 days ago
#79 - [ModelCache]提供多种存储方案能力。
Issue -
State: closed - Opened by peng3307165 28 days ago
#78 - 【MFTCoder】适配deepseek-v2
Issue -
State: open - Opened by wj882018 28 days ago
#77 - 【MFTCoder】撰写某领域MFT最佳实践tutorial
Issue -
State: open - Opened by wj882018 28 days ago
#76 - 【MFTCoder】增加usage example文档
Issue -
State: open - Opened by wj882018 28 days ago
- 1 comment
#75 - 【MFTCoder】增加与其他微调工具框架(例如llama-factory)的对比结果文档
Issue -
State: open - Opened by wj882018 28 days ago
#74 - 【MFTCoder】尝试用更多其他的开源数据集,在mftcoder上进行实验,对比MFT和SFT效果
Issue -
State: open - Opened by wj882018 28 days ago
#73 - 在mftcoder框架下实现更多的mft算法
Issue -
State: open - Opened by wj882018 28 days ago
#72 - 【MFTCoder】在mftcoder框架下实现更多的mft算法
Issue -
State: open - Opened by wj882018 28 days ago
#71 - Update README_cn.md
Pull Request -
State: closed - Opened by chencyudel 28 days ago
#70 - Update README_cn.md
Pull Request -
State: closed - Opened by chencyudel 28 days ago
#69 - Update README_cn.md
Pull Request -
State: closed - Opened by chencyudel 28 days ago
#68 - Add files via upload
Pull Request -
State: closed - Opened by chencyudel 28 days ago
#67 - Add files via upload
Pull Request -
State: closed - Opened by wj882018 about 1 month ago
#66 - Add files via upload
Pull Request -
State: closed - Opened by wj882018 about 1 month ago
#65 - model type
Issue -
State: open - Opened by XiaoMaGe-hero 3 months ago
#64 - 实验 MFTCoder 的效果总是不尽人意
Issue -
State: open - Opened by Chaochao2020 5 months ago
- 2 comments
#63 - Update requirements
Pull Request -
State: closed - Opened by chencyudel 5 months ago
#62 - readme
Pull Request -
State: closed - Opened by chencyudel 5 months ago
#61 - bugfix, remove default tensorboard writer to avoid permission issue
Pull Request -
State: closed - Opened by chencyudel 5 months ago
#60 - mftcoder 新版 Permission denied: '/home/admin' BUG
Issue -
State: closed - Opened by Chaochao2020 5 months ago
- 5 comments
#59 - mftcoder使用humaneval评估
Issue -
State: open - Opened by lwh8915 5 months ago
#58 - V0.4.dev
Pull Request -
State: closed - Opened by chencyudel 5 months ago
#57 - RuntimeError: CUDA error: invalid device ordinal
Issue -
State: closed - Opened by lwh8915 5 months ago
- 1 comment
#56 - Update README.md
Pull Request -
State: closed - Opened by twelveand0 6 months ago
#55 - 数据集loss 下降不均衡如何处理
Issue -
State: closed - Opened by huangmenglong 6 months ago
- 1 comment
#54 - Update README_cn.md
Pull Request -
State: closed - Opened by twelveand0 7 months ago
#53 - Update README.md
Pull Request -
State: closed - Opened by twelveand0 7 months ago
#52 - convergence curves
Issue -
State: closed - Opened by twelveand0 7 months ago
#51 - MFTCoder论文中训练数据集
Issue -
State: closed - Opened by superqing001 7 months ago
- 2 comments
#50 - Update README_cn.md to add Join-Us section
Pull Request -
State: closed - Opened by twelveand0 8 months ago
#49 - Update README.md to add Join-Us section
Pull Request -
State: closed - Opened by twelveand0 8 months ago
#48 - 任务的类型也是用gpt来生成的吗?
Issue -
State: closed - Opened by shatealaboxiaowang 8 months ago
- 1 comment
#47 - How can i do continue pretraining?
Issue -
State: open - Opened by hwaking 8 months ago
#46 - Update MFTCoder chat template
Pull Request -
State: closed - Opened by chencyudel 9 months ago
#45 - 模型微调完,合并时报错 NotImplementedError: Cannot copy out of meta tensor; no data!
Issue -
State: closed - Opened by xxyp 9 months ago
- 2 comments
#44 - Update README.md
Pull Request -
State: closed - Opened by twelveand0 9 months ago
#43 - 请问多机训练需要怎么修改?
Issue -
State: closed - Opened by jy00161yang 10 months ago
- 1 comment
#42 - qlora微调合并权重时出错
Issue -
State: closed - Opened by fangzexian 10 months ago
- 4 comments
#41 - Update README.md
Pull Request -
State: closed - Opened by chencyudel 10 months ago
#40 - readme
Pull Request -
State: closed - Opened by chencyudel 10 months ago
#39 - change jpg
Pull Request -
State: closed - Opened by chencyudel 10 months ago
#38 - Update README.md
Pull Request -
State: closed - Opened by jglee2046 10 months ago
#37 - V0.3.0 dev merge
Pull Request -
State: closed - Opened by chencyudel 10 months ago
#36 - 请问下是否支持Wandb或者Tensorboard
Issue -
State: closed - Opened by pydaxing 10 months ago
- 1 comment
#35 - 请问要支持chatglm3-6b-base的话需要哪些更改
Issue -
State: closed - Opened by kevindany 10 months ago
- 2 comments
#35 - 请问要支持chatglm3-6b-base的话需要哪些更改
Issue -
State: closed - Opened by kevindany 10 months ago
- 2 comments
#34 - 请教4int的gptq模型能不能进行lora微调
Issue -
State: closed - Opened by wengyuan722 11 months ago
- 4 comments
#33 - 模型是否支持商用
Issue -
State: closed - Opened by zhangyukun230 11 months ago
- 5 comments
#32 - ValueError: Asking to pad but the tokenizer does not have a padding token. Please select a token to use as `pad_token` `(tokenizer.pad_token = tokenizer.eos_token e.g.)` or add a new pad token via `tokenizer.add_special_tokens({'pad_token': '[PAD]'})`.
Issue -
State: closed - Opened by sxsxsx 11 months ago
- 9 comments
#31 - nccl 报错了
Issue -
State: closed - Opened by belle9217 11 months ago
- 3 comments
#30 - no 7B model size?
Issue -
State: closed - Opened by yiyepiaoling0715 11 months ago
- 2 comments
#29 - json.decoder.JSONDecodeError: Expecting value: line 1 column 2 (char 1)
Issue -
State: closed - Opened by sxsxsx 11 months ago
- 5 comments
#28 - 单卡v1000,微调报错
Issue -
State: closed - Opened by sxsxsx 11 months ago
- 2 comments
#27 - Something wrong when run 'bash run_bash.sh'
Issue -
State: closed - Opened by MaoYouSi 11 months ago
- 1 comment
#26 - 数据问题ValueError: data format not supported, please use prompt/answer, or chatML or pretrain text
Issue -
State: closed - Opened by mst272 11 months ago
#25 - 代码中对于3.5 Multitask Fine-Tuning with Balanced Losses的具体实现的位置(只找到了第一个loss的实现)
Issue -
State: closed - Opened by YanqiDai 11 months ago
- 2 comments
#24 - NotImplementedError: Cannot copy out of meta tensor; no data!
Issue -
State: closed - Opened by zzb2019053515 11 months ago
- 1 comment
#23 - 如何构建codefuse-llamacode的提问和终止符
Issue -
State: closed - Opened by wengyuan722 11 months ago
- 29 comments
#22 - loss计算那里 RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn
Issue -
State: closed - Opened by hhy150 11 months ago
- 3 comments
#21 - little bug fix meet
Issue -
State: closed - Opened by elcky 11 months ago
- 2 comments
#20 - safetensors_rust.SafetensorError: Error while deserializing header: HeaderTooLarge
Issue -
State: closed - Opened by zzb2019053515 11 months ago
- 3 comments
#19 - 模型训练没有进度条
Issue -
State: closed - Opened by liujingqiao 11 months ago
#18 - 在codellama上微调的性能没有提升
Issue -
State: closed - Opened by HPRCEST 11 months ago
- 2 comments
#17 - Update README.md
Pull Request -
State: closed - Opened by huybery 12 months ago
#16 - 请问,对模型进行多任务微调该怎么设计jsonl数据集?
Issue -
State: closed - Opened by a793181018 12 months ago
- 5 comments
#15 - Inquiry about weighted_loss_mode
Issue -
State: closed - Opened by tszdanger 12 months ago
- 1 comment
#14 - 请问FSDP的训练API啥时候会开源出来
Issue -
State: closed - Opened by peiji1981 12 months ago
- 1 comment
#13 - 麻烦我想问下一个可行性问题,对CodeFuse-CodeGeeX2-6B进行微调时是否可以使用peft的方式中chatglm2 config进行微调?万分感谢🙏
Issue -
State: closed - Opened by whyPeanutbutter 12 months ago
- 1 comment
#12 - readme.txt指出,训练数据为jsonl格式,参考项目中的xxx.jsonl文件。未搜到对应的参考jsonl文件,能否麻烦给出一个示例?谢谢🙏
Issue -
State: closed - Opened by whyPeanutbutter 12 months ago
- 2 comments
#11 - data.helper 无法加载?
Issue -
State: closed - Opened by liudonglei 12 months ago
- 4 comments
#10 - about focal loss mentioned in the paper
Issue -
State: closed - Opened by iDonal almost 1 year ago
- 1 comment
#9 - 能否写一个完整的微调例子?
Issue -
State: closed - Opened by liudonglei about 1 year ago
- 1 comment
#8 - 使用lora + zero3微调CodeFuse-CodeLlama-34B后,合并模型失败
Issue -
State: closed - Opened by 3m123 about 1 year ago
- 3 comments
#7 - MFTCoder微调codefuse34b模型后,发现模型代码补全这块的回复能力就没了,求解决方案
Issue -
State: closed - Opened by yangyubin1 about 1 year ago
- 3 comments
#6 - 基于chatgpt生成的高质量python练习题数据是如何获取呀
Issue -
State: closed - Opened by 18liumin about 1 year ago
- 1 comment
#5 - Update README_cn.md
Pull Request -
State: closed - Opened by wj882018 about 1 year ago
#4 - Update README.md
Pull Request -
State: closed - Opened by wj882018 about 1 year ago
#3 - HumanEval测试的Pass@1不高
Issue -
State: closed - Opened by wangzhao88 about 1 year ago
- 2 comments
#2 - 国内下载方式
Issue -
State: closed - Opened by wuyihz about 1 year ago
- 1 comment
#1 - 训练数据包含中文数据吗
Issue -
State: closed - Opened by smashfan about 1 year ago
- 1 comment