Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / seungeunrho/minimalRL issues and pull requests

#63 - Wrong formula for calc-target in SAC?

Issue - State: open - Opened by BeFranke about 2 months ago

#62 - Training speed is very slow!!!

Issue - State: open - Opened by xuzhou666 8 months ago - 1 comment

#61 - Update for the latest library enviromnet

Pull Request - State: closed - Opened by Lukious over 1 year ago - 1 comment

#60 - fixes syntax

Pull Request - State: open - Opened by lazybuttrying over 1 year ago - 1 comment

#59 - TypeError: expected np.ndarray (got tuple)

Issue - State: open - Opened by InguChoi almost 2 years ago - 1 comment

#58 - Remove redundant while loop break in dqn.py

Pull Request - State: open - Opened by ginoperrotta almost 3 years ago

#57 - DQN why train iterate for 10 times

Issue - State: open - Opened by FeynmanDNA almost 3 years ago

#56 - MuZero minimal implementation

Issue - State: open - Opened by ipsec about 3 years ago

#55 - Implemented r2d2

Pull Request - State: open - Opened by jsrimr about 3 years ago

#54 - implemented ape-x

Pull Request - State: open - Opened by jsrimr about 3 years ago

#53 - Update dqn.py

Pull Request - State: closed - Opened by jsrimr about 3 years ago

#52 - Minimal way to save / replay trained model?

Issue - State: open - Opened by HanClinto about 3 years ago

#51 - Add minimal IMPALA?

Issue - State: closed - Opened by meadewaking over 3 years ago - 2 comments

#50 - Query about LSTM

Issue - State: open - Opened by npitsillos over 3 years ago

#49 - Add meta RL algorithms?

Issue - State: open - Opened by ghost over 3 years ago

#48 - Cartpole environment with Multidiscrete action space

Issue - State: open - Opened by mg64ve over 3 years ago

#47 - Cartpole environment with Multidiscrete action space

Issue - State: closed - Opened by mgazzin over 3 years ago - 3 comments

#46 - A naive question about updating parameters in DDPG.

Issue - State: open - Opened by HiddenBeginner over 3 years ago

#45 - ppo minibatch version

Pull Request - State: open - Opened by seolhokim over 3 years ago

#44 - improve continuous-ppo

Pull Request - State: open - Opened by seolhokim over 3 years ago

#43 - Remove unused import

Issue - State: closed - Opened by neal2018 over 3 years ago - 1 comment

#42 - cartpole ppo train , reward drop

Issue - State: open - Opened by SeungyounShin over 3 years ago - 1 comment

#41 - Detach the target

Pull Request - State: closed - Opened by jsrimr almost 4 years ago - 1 comment

#40 - Maybe a bug in SAC Implementation?

Issue - State: closed - Opened by arthur-x almost 4 years ago - 1 comment

#38 - Soft Actor Critic?

Issue - State: closed - Opened by EmmanuelMess almost 4 years ago - 1 comment

#37 - TD3: Twin Delayed DDPG

Issue - State: open - Opened by zcaicaros almost 4 years ago - 2 comments

#36 - PPO update mistake?

Issue - State: closed - Opened by zcaicaros almost 4 years ago - 1 comment

#35 - fix ddpg data type error

Pull Request - State: closed - Opened by seolhokim almost 4 years ago - 1 comment

#34 - RuntimeError while running DDPG.py

Issue - State: closed - Opened by rl-max almost 4 years ago - 2 comments

#33 - The ratio in ppo.py should be detach() ?

Issue - State: closed - Opened by dedekinds about 4 years ago - 5 comments

#32 - Missing done mask?

Issue - State: closed - Opened by Junyoungpark about 4 years ago - 3 comments

#31 - torch.gather in relevant to policy gradient

Issue - State: open - Opened by migom6 over 4 years ago

#30 - PPO has no entropy factor

Issue - State: open - Opened by CesMak over 4 years ago

#29 - Questions about A3C

Issue - State: closed - Opened by ghost over 4 years ago - 1 comment

#28 - Termination of a CartPole episode in REINFORCE.py

Issue - State: closed - Opened by ansari1375 over 4 years ago - 1 comment

#27 - fix : #26 the bug that was updated per step was fixed

Pull Request - State: closed - Opened by seolhokim almost 5 years ago

#26 - Problem of `train_net()` in REINFORCE algorithm.

Issue - State: closed - Opened by fuyw almost 5 years ago - 5 comments

#26 - Problem of `train_net()` in REINFORCE algorithm.

Issue - State: closed - Opened by fuyw almost 5 years ago - 5 comments

#25 - Sac test

Pull Request - State: closed - Opened by seolhokim almost 5 years ago - 1 comment

#24 - td3 test

Pull Request - State: closed - Opened by seolhokim almost 5 years ago - 1 comment

#23 - continuous-ppo

Pull Request - State: closed - Opened by seolhokim almost 5 years ago - 3 comments

#22 - ddpg unnecessary codes has been deleted

Pull Request - State: closed - Opened by seolhokim almost 5 years ago

#21 - ddpg minor bug

Pull Request - State: closed - Opened by seolhokim almost 5 years ago - 1 comment

#20 - continuous ppo test version

Pull Request - State: closed - Opened by seolhokim almost 5 years ago - 1 comment

#19 - Add SAC?

Issue - State: closed - Opened by banma12956 almost 5 years ago

#18 - TF2 implementation for Policy Gradient Reinforce

Issue - State: closed - Opened by dragen1860 about 5 years ago

#17 - LSTM + PPO value fitting

Issue - State: closed - Opened by hnshahao about 5 years ago - 1 comment

#16 - Fix indexing error in retrace operation of ACER

Pull Request - State: closed - Opened by wwiiiii about 5 years ago - 1 comment

#15 - Wrong gradient flow in bias correction term of ACER?

Issue - State: closed - Opened by wwiiiii about 5 years ago - 1 comment

#14 - docs(README): add links to source code; format

Pull Request - State: closed - Opened by jjangga0214 about 5 years ago - 1 comment

#13 - Add A2C

Pull Request - State: closed - Opened by rahulptel about 5 years ago

#12 - PPO Continuous Action Space

Issue - State: closed - Opened by raunakdoesdev about 5 years ago - 2 comments

#11 - Add new algorithms

Issue - State: open - Opened by rahulptel about 5 years ago - 7 comments

#10 - Fix improper asynchronous updates in A3C

Pull Request - State: closed - Opened by rahulptel about 5 years ago

#9 - Improper asynchronous update in a3c

Issue - State: closed - Opened by rahulptel about 5 years ago - 1 comment

#8 - Wrong td_target and test() call in a3c implementation

Issue - State: closed - Opened by rahulptel about 5 years ago - 1 comment

#7 - Typo of actor_critic.py?

Issue - State: closed - Opened by seungwonpark about 5 years ago - 1 comment

#6 - Please add 1 continuous env

Issue - State: closed - Opened by bionicles over 5 years ago - 2 comments

#5 - Update ppo.py

Pull Request - State: closed - Opened by jsrimr over 5 years ago - 1 comment

#4 - Remove ReplayBuffer class

Pull Request - State: closed - Opened by jwergieluk over 5 years ago - 2 comments

#3 - Use maxlen in deque initializer

Issue - State: closed - Opened by jwergieluk over 5 years ago - 1 comment

#2 - train() overwrites the base method of nn.Module

Issue - State: closed - Opened by NikEyX over 5 years ago - 1 comment

#1 - Reinforce implementation looks to use old data without importance sampling

Issue - State: closed - Opened by sritee over 5 years ago - 1 comment