Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / l294265421/alpaca-rlhf issues and pull requests
#16 - 增大max_prompt_len和max_ans_len训练会出现非法的内存访问问题
Issue -
State: open - Opened by Luoxiaohei41 12 months ago
#15 - 训练问题
Issue -
State: open - Opened by wanghao-007 about 1 year ago
#14 - Step 3: Actor model和Reward model使用不同的tokenizer
Issue -
State: open - Opened by Kevin-myxu over 1 year ago
#13 - step2和step3中padding side似乎不一样?
Issue -
State: open - Opened by qiancheng99 over 1 year ago
- 1 comment
#12 - A question about setting tokens
Issue -
State: open - Opened by hepj987 over 1 year ago
- 1 comment
#11 - element 0 of tensors does not require grad and does not have a grad_fn
Issue -
State: open - Opened by Bill-Orz over 1 year ago
- 5 comments
#10 - Fix pad_token_id bug
Issue -
State: closed - Opened by Ablustrund over 1 year ago
- 2 comments
#9 - 关于Step3中是否需要把生成的answer中eos后面token mask掉
Issue -
State: closed - Opened by Ablustrund over 1 year ago
- 1 comment
#8 - deepspeed.initialize的一些疑惑
Issue -
State: closed - Opened by iamsile over 1 year ago
- 8 comments
#7 - how to run it, need more details
Issue -
State: open - Opened by SeekPoint over 1 year ago
- 2 comments
#6 - v100 step3 oom
Issue -
State: closed - Opened by iamsile over 1 year ago
- 12 comments
#5 - stop at step2 evaluation_reward
Issue -
State: open - Opened by murphypei over 1 year ago
- 4 comments
#4 - reward model在v100上训练时会卡住不动
Issue -
State: closed - Opened by iamsile over 1 year ago
- 2 comments
#3 - v100训练时显存oom
Issue -
State: closed - Opened by iamsile over 1 year ago
- 2 comments
#2 - Steps
Issue -
State: open - Opened by syngokhan over 1 year ago
- 1 comment
#1 - 训练效果怎么样
Issue -
State: closed - Opened by Curious-chen over 1 year ago
- 3 comments