Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / ssbuild/qwen_finetuning issues and pull requests
#29 - 你好,请教下,bash train_full.sh -m train微调后的last.ckpt模型,有办法转成类似https://huggingface.co/Qwen/Qwen-1_8B-Chat/tree/main下面的safetensors文件吗
Issue -
State: open - Opened by dgo2dance about 1 year ago
- 1 comment
#28 - 你好,这个训练参数如何配置,如何命令启动训练
Issue -
State: open - Opened by dgo2dance about 1 year ago
- 3 comments
#27 - 请问system 的 prompt为啥固定为 You are a helpful assistant.
Issue -
State: closed - Opened by tangsipeng about 1 year ago
- 2 comments
#26 - fix script
Pull Request -
State: closed - Opened by ssbuild over 1 year ago
#25 - fix script
Pull Request -
State: closed - Opened by ssbuild over 1 year ago
#24 - auto precision
Pull Request -
State: closed - Opened by ssbuild over 1 year ago
#22 - "gradient_checkpointing": False
Pull Request -
State: closed - Opened by ssbuild over 1 year ago
#21 - support accelerator
Pull Request -
State: closed - Opened by ssbuild over 1 year ago
#20 - v0.2.5
Pull Request -
State: closed - Opened by ssbuild over 1 year ago
#19 - v0.2.5
Pull Request -
State: closed - Opened by ssbuild over 1 year ago
#18 - support ia3
Pull Request -
State: closed - Opened by ssbuild over 1 year ago
#17 - 请教关于<|endoftext|>的问题
Issue -
State: closed - Opened by quzx over 1 year ago
- 6 comments
#16 - attention_mask有bug
Issue -
State: closed - Opened by quzx over 1 year ago
- 7 comments
#15 - support ia3
Pull Request -
State: closed - Opened by ssbuild over 1 year ago
#14 - 0.2.4
Pull Request -
State: closed - Opened by ssbuild over 1 year ago
#12 - deepspeed precision
Pull Request -
State: closed - Opened by ssbuild over 1 year ago
#11 - update
Pull Request -
State: closed - Opened by ssbuild over 1 year ago
#10 - 0.2.0
Pull Request -
State: closed - Opened by ssbuild over 1 year ago
#9 - 反序列化遇到问题
Issue -
State: closed - Opened by wellcasa over 1 year ago
- 2 comments
#8 - NN_DataHelper
Issue -
State: closed - Opened by wellcasa over 1 year ago
- 4 comments
#7 - deep_training 这个包里报错 。挺奇怪。
Issue -
State: closed - Opened by wellcasa over 1 year ago
- 15 comments
#6 - 有报错,貌似没有设置eos_token
Issue -
State: closed - Opened by wellcasa over 1 year ago
- 2 comments
#5 - 重新拉取了最新的Qwen-chat的所有模型和配置,以及本项目的最新代码,无法进行lora微调了
Issue -
State: closed - Opened by troycjj over 1 year ago
- 3 comments
Labels: bug
#4 - lora推理报错
Issue -
State: closed - Opened by troycjj over 1 year ago
- 1 comment
#3 - int4量化模型transformers使用报错
Issue -
State: closed - Opened by yxk9810 over 1 year ago
- 2 comments
#2 - 当batchsize>1 显示shape '[batchsize, -1]' is invalid for input of attention_mask size
Issue -
State: closed - Opened by xxll88 over 1 year ago
- 1 comment
Labels: bug
#1 - infer.py推理异常
Issue -
State: closed - Opened by evanweiguohua over 1 year ago
- 1 comment
Labels: bug