Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / boyu-ai/Hands-on-RL issues and pull requests

#87 - 请教PPO问题

Issue - State: open - Opened by 394262597 28 days ago

#87 - 请教PPO问题

Issue - State: open - Opened by 394262597 28 days ago

#86 - Chapter 7

Issue - State: open - Opened by A1513906286 about 1 month ago - 1 comment

#86 - Chapter 7

Issue - State: open - Opened by A1513906286 about 1 month ago - 1 comment

#85 - Chapter 7

Issue - State: open - Opened by A1513906286 about 1 month ago

#85 - Chapter 7

Issue - State: open - Opened by A1513906286 about 1 month ago

#84 - 多臂老虎机ε - 贪心算法 解释部分有问题

Issue - State: open - Opened by gymdarius about 1 month ago

#84 - 多臂老虎机ε - 贪心算法 解释部分有问题

Issue - State: open - Opened by gymdarius about 1 month ago

#83 - trpo

Issue - State: open - Opened by L-lorish about 1 month ago

#83 - trpo

Issue - State: open - Opened by L-lorish about 1 month ago

#82 - 策略梯度证明笔误?

Issue - State: open - Opened by lanceyliao about 2 months ago - 2 comments

#82 - 策略梯度证明笔误?

Issue - State: open - Opened by lanceyliao about 2 months ago - 2 comments

#81 - 第10章Actor-Critic中actor_loss为何加torch.mean?

Issue - State: closed - Opened by lanceyliao about 2 months ago

#81 - 第10章Actor-Critic中actor_loss为何加torch.mean?

Issue - State: closed - Opened by lanceyliao about 2 months ago

#80 - 3.6. 占用度量,为何逆序计算?

Issue - State: closed - Opened by lanceyliao 2 months ago - 1 comment

#80 - 3.6. 占用度量,为何逆序计算?

Issue - State: closed - Opened by lanceyliao 2 months ago - 1 comment

#79 - 第九章策略梯度的损失函数

Issue - State: open - Opened by mgt-lya 3 months ago - 1 comment

#79 - 第九章策略梯度的损失函数

Issue - State: open - Opened by mgt-lya 3 months ago - 1 comment

#78 - https://www.boyuai.com/进不去了

Issue - State: open - Opened by virtualxiaoman 4 months ago - 1 comment

#78 - https://www.boyuai.com/进不去了

Issue - State: open - Opened by virtualxiaoman 4 months ago - 1 comment

#77 - 马尔可夫决策过程,MDP转化为MRP时计算的P疑似有误

Issue - State: open - Opened by zyy777 5 months ago - 1 comment

#77 - 马尔可夫决策过程,MDP转化为MRP时计算的P疑似有误

Issue - State: open - Opened by zyy777 5 months ago - 1 comment

#76 - 关于web教程布局的建议

Issue - State: open - Opened by dctwan15 5 months ago

#76 - 关于web教程布局的建议

Issue - State: open - Opened by dctwan15 5 months ago

#73 - 关于环境初始化的一点提示

Issue - State: open - Opened by Summer907 6 months ago

#73 - 关于环境初始化的一点提示

Issue - State: open - Opened by Summer907 6 months ago

#72 - CartPole-v0环境训练reward超过上限值200?

Issue - State: closed - Opened by SHTechBoBo 6 months ago - 1 comment

#72 - CartPole-v0环境训练reward超过上限值200?

Issue - State: closed - Opened by SHTechBoBo 6 months ago - 1 comment

#70 - PPO在单摆实验中为什么要对reward=(reward+8)/8的修改呢?

Issue - State: closed - Opened by xxoospring 9 months ago - 2 comments

#70 - PPO在单摆实验中为什么要对reward=(reward+8)/8的修改呢?

Issue - State: closed - Opened by xxoospring 9 months ago - 2 comments

#69 - SAC伪代码存在一点小问题

Issue - State: open - Opened by taojunhui 9 months ago

#69 - SAC伪代码存在一点小问题

Issue - State: open - Opened by taojunhui 9 months ago

#68 - DQN ReplayBuffer

Issue - State: open - Opened by xxoospring 10 months ago - 1 comment

#68 - DQN ReplayBuffer

Issue - State: open - Opened by xxoospring 10 months ago - 1 comment

#67 - 用spyder跑PPO代码,kernel自动关闭了

Issue - State: closed - Opened by Shawkncok 10 months ago - 1 comment

#67 - 用spyder跑PPO代码,kernel自动关闭了

Issue - State: closed - Opened by Shawkncok 10 months ago - 1 comment

#64 - 7.4 DQN 算法反向传播有没有进行求导??

Issue - State: open - Opened by anranyicheng 11 months ago - 1 comment

#64 - 7.4 DQN 算法反向传播有没有进行求导??

Issue - State: open - Opened by anranyicheng 11 months ago - 1 comment

#63 - SAC算法——状态价值函数存在问题

Issue - State: open - Opened by Dilettante258 11 months ago

#63 - SAC算法——状态价值函数存在问题

Issue - State: open - Opened by Dilettante258 11 months ago

#62 - 运行环境

Issue - State: open - Opened by zheng-lv 11 months ago - 1 comment

#62 - 运行环境

Issue - State: open - Opened by zheng-lv 11 months ago - 1 comment

#61 - 21章MADDPG代码问题,存在维度不匹配

Issue - State: open - Opened by CorneliusDeng 11 months ago - 2 comments

#61 - 21章MADDPG代码问题,存在维度不匹配

Issue - State: open - Opened by CorneliusDeng 11 months ago - 2 comments

#60 - 20章的代码问题

Issue - State: open - Opened by Wayne857 11 months ago - 3 comments

#60 - 20章的代码问题

Issue - State: open - Opened by Wayne857 11 months ago - 3 comments

#59 - 第七章DNQ回报超出200

Issue - State: closed - Opened by KingOfChuXuan 12 months ago - 1 comment

#59 - 第七章DNQ回报超出200

Issue - State: closed - Opened by KingOfChuXuan 12 months ago - 1 comment

#58 - 已解决

Issue - State: closed - Opened by Thovenfish 12 months ago

#58 - 已解决

Issue - State: closed - Opened by Thovenfish 12 months ago

#56 - MARL的PPT的第7页和8页参考文献咋相同?

Issue - State: open - Opened by StevenJokess about 1 year ago - 1 comment

#56 - MARL的PPT的第7页和8页参考文献咋相同?

Issue - State: open - Opened by StevenJokess about 1 year ago - 1 comment

#54 - 第八章 `In [7]`代码块,VAnet() 疑似有误

Issue - State: open - Opened by Aegis1863 about 1 year ago - 1 comment

#54 - 第八章 `In [7]`代码块,VAnet() 疑似有误

Issue - State: open - Opened by Aegis1863 about 1 year ago - 1 comment

#51 - 关于开发环境配置

Issue - State: open - Opened by mellody11 about 1 year ago - 4 comments

#51 - 关于开发环境配置

Issue - State: open - Opened by mellody11 about 1 year ago - 4 comments

#50 - 第七章DQN代运行报错

Issue - State: open - Opened by ShuoZheLi about 1 year ago - 3 comments

#50 - 第七章DQN代运行报错

Issue - State: open - Opened by ShuoZheLi about 1 year ago - 3 comments

#49 - 制作了 EPUB 格式

Issue - State: open - Opened by wizardforcel about 1 year ago

#49 - 制作了 EPUB 格式

Issue - State: open - Opened by wizardforcel about 1 year ago

#47 - 蒙特卡罗采样动作和状态 temp变量为什么是累加呢

Issue - State: open - Opened by ChengchengDu over 1 year ago - 1 comment

#47 - 蒙特卡罗采样动作和状态 temp变量为什么是累加呢

Issue - State: open - Opened by ChengchengDu over 1 year ago - 1 comment

#46 - DDPG算法篇笔误

Issue - State: closed - Opened by Neuerliu over 1 year ago - 1 comment

#46 - DDPG算法篇笔误

Issue - State: closed - Opened by Neuerliu over 1 year ago - 1 comment

#45 - 第18章cql代码

Issue - State: open - Opened by Jaceyxy over 1 year ago

#45 - 第18章cql代码

Issue - State: open - Opened by Jaceyxy over 1 year ago

#43 - 第十六章 模型预测控制 EnsembleModel类:train方法的问题

Issue - State: open - Opened by Yandong23 over 1 year ago - 1 comment

#43 - 第十六章 模型预测控制 EnsembleModel类:train方法的问题

Issue - State: open - Opened by Yandong23 over 1 year ago - 1 comment

#42 - 第20章 未定义win?

Issue - State: closed - Opened by beyondliaaaa over 1 year ago

#42 - 第20章 未定义win?

Issue - State: closed - Opened by beyondliaaaa over 1 year ago

#41 - 3.5公式不准确

Issue - State: closed - Opened by administrator418 over 1 year ago

#41 - 3.5公式不准确

Issue - State: closed - Opened by administrator418 over 1 year ago

#40 - Dueling DQN部分的疑问

Issue - State: open - Opened by Ruanzhh over 1 year ago - 2 comments

#40 - Dueling DQN部分的疑问

Issue - State: open - Opened by Ruanzhh over 1 year ago - 2 comments

#39 - 网页版本与纸质书的区别?

Issue - State: open - Opened by sibangde over 1 year ago

#39 - 网页版本与纸质书的区别?

Issue - State: open - Opened by sibangde over 1 year ago