Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / lucidrains/self-rewarding-lm-pytorch issues and pull requests

#31 - What's the reference model for DPO?

Issue - State: closed - Opened by Draconda 5 months ago - 1 comment

#29 - Fixed deep copy, shallow copy error and label mask error.

Pull Request - State: closed - Opened by Control-derek 6 months ago - 1 comment

#28 - Solves the problem that some variables are not declared

Pull Request - State: closed - Opened by Control-derek 6 months ago - 1 comment

#27 - Solves the problem that some variables are not declared

Pull Request - State: closed - Opened by Control-derek 6 months ago - 1 comment

#26 - add self.

Pull Request - State: closed - Opened by Control-derek 6 months ago - 1 comment

#25 - ModuleNotFoundError: No module named 'x_transformers'

Issue - State: open - Opened by mayankpathaklumiq 7 months ago - 1 comment

#21 - I encountered the following error when trying to run usage

Issue - State: open - Opened by Yanfors 7 months ago - 1 comment

#19 - Fix TypeError for is_valid_reward in SelfRewardDPOConfig

Pull Request - State: closed - Opened by ViswanathaReddyGajjala 7 months ago - 1 comment

#18 - TypeError: tuple indices must be integers or slices, not tuple

Issue - State: open - Opened by fakerybakery 7 months ago - 1 comment

#17 - Update self_rewarding_lm_pytorch.py

Pull Request - State: closed - Opened by unaidedelf8777 8 months ago - 1 comment

#15 - RuntimeError: Placeholder storage has not been allocated on MPS device!

Issue - State: closed - Opened by fakerybakery 8 months ago - 2 comments

#14 - Multiple GPUs

Issue - State: closed - Opened by fakerybakery 8 months ago

#13 - Update self_rewarding_lm_pytorch.py

Pull Request - State: closed - Opened by Dyke-F 8 months ago - 1 comment

#12 - Update spin.py

Pull Request - State: closed - Opened by Dyke-F 8 months ago - 2 comments

#10 - How to use HF Transformers model

Issue - State: open - Opened by fakerybakery 8 months ago - 3 comments

#9 - Default `iteration` about SPIN. (Reward model~Policy model)

Issue - State: closed - Opened by KyujinHan 8 months ago - 1 comment

#8 - run spin demo

Issue - State: closed - Opened by westlongtime 8 months ago - 3 comments

#7 - The reward prompt is weak.

Issue - State: closed - Opened by Minami-su 8 months ago - 6 comments

#5 - Update README.md

Pull Request - State: closed - Opened by eltociear 8 months ago - 1 comment

#4 - Is this work in progress?

Issue - State: closed - Opened by jbdatascience 8 months ago - 4 comments

#3 - Help with Setting up and running ?

Issue - State: closed - Opened by badboysm890 8 months ago - 1 comment

#1 - code and dataset?

Issue - State: closed - Opened by wanghao-007 8 months ago