Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / boyu-ai/Hands-on-RL issues and pull requests
#38 - Action Space Limitations in Continuous PPO Algorithm in Chapter 12
Issue -
State: open - Opened by ASUKaiwenFang over 1 year ago
- 1 comment
#38 - Action Space Limitations in Continuous PPO Algorithm in Chapter 12
Issue -
State: open - Opened by ASUKaiwenFang over 1 year ago
- 1 comment
#37 - 3.5公式错误
Issue -
State: open - Opened by StevenJokess over 1 year ago
- 1 comment
#37 - 3.5公式错误
Issue -
State: open - Opened by StevenJokess over 1 year ago
- 1 comment
#36 - 14.3公式错误
Issue -
State: open - Opened by StevenJokess over 1 year ago
#36 - 14.3公式错误
Issue -
State: open - Opened by StevenJokess over 1 year ago
#35 - 1.3节强化学习的环境中的公式含义不清
Issue -
State: open - Opened by qixitan over 1 year ago
#35 - 1.3节强化学习的环境中的公式含义不清
Issue -
State: open - Opened by qixitan over 1 year ago
#34 - 2.5公式错误
Issue -
State: open - Opened by StevenJokess over 1 year ago
- 1 comment
#33 - 关于使用multiDiscrete acttion spaces的例子
Issue -
State: open - Opened by jianzuo over 1 year ago
#33 - 关于使用multiDiscrete acttion spaces的例子
Issue -
State: open - Opened by jianzuo over 1 year ago
#32 - 2.4公式错误
Issue -
State: open - Opened by StevenJokess over 1 year ago
#32 - 2.4公式错误
Issue -
State: open - Opened by StevenJokess over 1 year ago
#31 - ValueError: expected sequence of length 3 at dim 2 (got 0)
Issue -
State: open - Opened by Yang1231 over 1 year ago
- 6 comments
#31 - ValueError: expected sequence of length 3 at dim 2 (got 0)
Issue -
State: open - Opened by Yang1231 over 1 year ago
- 6 comments
#30 - 第7章-DQN算法 训练时报出错误 ValueError: expected sequence of length 4 at dim 2 (got 0)
Issue -
State: closed - Opened by horacehht over 1 year ago
- 8 comments
#30 - 第7章-DQN算法 训练时报出错误 ValueError: expected sequence of length 4 at dim 2 (got 0)
Issue -
State: closed - Opened by horacehht over 1 year ago
- 8 comments
#29 - 第 10 章 Actor-Critic 算法代码实践
Issue -
State: closed - Opened by zlh-seuer over 1 year ago
- 2 comments
#29 - 第 10 章 Actor-Critic 算法代码实践
Issue -
State: closed - Opened by zlh-seuer over 1 year ago
- 2 comments
#28 - 第 10 章 Actor-Critic 算法 语法小问题.
Issue -
State: closed - Opened by lbgitjp almost 2 years ago
- 1 comment
#28 - 第 10 章 Actor-Critic 算法 语法小问题.
Issue -
State: closed - Opened by lbgitjp almost 2 years ago
- 1 comment
#27 - 第7章-DQN算法 语法小问题.
Issue -
State: closed - Opened by lbgitjp almost 2 years ago
#27 - 第7章-DQN算法 语法小问题.
Issue -
State: closed - Opened by lbgitjp almost 2 years ago
#26 - 第4章-动态规划算法.ipynb 不必要的循环代码
Issue -
State: closed - Opened by lbgitjp almost 2 years ago
#26 - 第4章-动态规划算法.ipynb 不必要的循环代码
Issue -
State: closed - Opened by lbgitjp almost 2 years ago
#25 - 第4章-动态规划算法.ipynb 转移矩阵中p的值永远为1.
Issue -
State: closed - Opened by lbgitjp almost 2 years ago
#25 - 第4章-动态规划算法.ipynb 转移矩阵中p的值永远为1.
Issue -
State: closed - Opened by lbgitjp almost 2 years ago
#24 - 第20章IPPO训练速度
Issue -
State: open - Opened by Chuan-shanjia almost 2 years ago
- 1 comment
#24 - 第20章IPPO训练速度
Issue -
State: open - Opened by Chuan-shanjia almost 2 years ago
- 1 comment
#23 - 第7章-DQN算法.ipynb 可否给出完整的基于图像CNN的DQN代码学习
Issue -
State: open - Opened by haoshuiwuxiang almost 2 years ago
- 1 comment
#23 - 第7章-DQN算法.ipynb 可否给出完整的基于图像CNN的DQN代码学习
Issue -
State: open - Opened by haoshuiwuxiang almost 2 years ago
- 1 comment
#22 - some questions for rl_utils.py
Issue -
State: closed - Opened by Asuna233 almost 2 years ago
#22 - some questions for rl_utils.py
Issue -
State: closed - Opened by Asuna233 almost 2 years ago
#21 - 可否打包一个包含运行代码环境的docker镜像
Issue -
State: open - Opened by haoshuiwuxiang almost 2 years ago
- 1 comment
#21 - 可否打包一个包含运行代码环境的docker镜像
Issue -
State: open - Opened by haoshuiwuxiang almost 2 years ago
- 1 comment
#20 - 勘误: 强化学习基础篇-多臂老虎机-ϵ-贪心算法 DecayingEpsilonGreedy 代码缩进/排版问题
Issue -
State: closed - Opened by earlytobed almost 2 years ago
- 1 comment
#20 - 勘误: 强化学习基础篇-多臂老虎机-ϵ-贪心算法 DecayingEpsilonGreedy 代码缩进/排版问题
Issue -
State: closed - Opened by earlytobed almost 2 years ago
- 1 comment
#19 - 第7章DQN算法中update函数中的dones参数的含义释义
Issue -
State: closed - Opened by Yandong1223 almost 2 years ago
- 2 comments
#18 - 能给代码加点注释吗
Issue -
State: closed - Opened by cloudmisst almost 2 years ago
- 1 comment
#17 - a little piece of advice and modification in chapter 3.3.2 价值函数
Issue -
State: closed - Opened by erjiaxiao almost 2 years ago
- 1 comment
#16 - 第七章代码报错
Issue -
State: closed - Opened by hongsheng2000 almost 2 years ago
- 1 comment
#15 - 请问其他的算法的代码还讲解吗?
Issue -
State: closed - Opened by 670555467 almost 2 years ago
- 1 comment
#14 - 找不到网站的反馈渠道,在这里反馈一下。
Issue -
State: closed - Opened by leshui1991 about 2 years ago
- 1 comment
#13 - 请教“第3章-马尔可夫决策过程”中的采样函数代码问题
Issue -
State: closed - Opened by kj-wu about 2 years ago
- 4 comments
#11 - 请教,动态规划算法计算Q(s,a)价值
Issue -
State: closed - Opened by goodzhangbobo about 2 years ago
- 6 comments
#7 - some question about codes of REINFORCE
Issue -
State: closed - Opened by zhiyiZeng over 2 years ago
- 2 comments
#5 - 眼神不太好,买书赠pdf版书嘛
Issue -
State: closed - Opened by 18600130137 over 2 years ago
- 1 comment
#4 - "rl_utils" 这个脚本在哪里可以下载到呢?
Issue -
State: closed - Opened by ihewro almost 3 years ago
- 1 comment
#3 - ” 而本节课程中即将介绍的 Dyna-Q 算法也是非常基础的基于模型的强化学习方法,它的环境模型是通过估计得到。“ 是否有误
Issue -
State: closed - Opened by ihewro almost 3 years ago
- 1 comment
#2 - "每10条序列打印一下这10条序列的平均回报" 是否有误
Issue -
State: closed - Opened by ihewro almost 3 years ago
- 1 comment