werner-duvaud/muzero-general issues and pull requests

#233 - Api integration & won games statistic

Pull Request - State: closed - Opened by slimyjimmy 15 days ago

#232 - Bump actions/download-artifact from 2 to 4.1.7 in /.github/workflows

Pull Request - State: open - Opened by dependabot[bot] 6 months ago
Labels: dependencies

#231 - Chess and other non-trivial games

Issue - State: open - Opened by StepHaze about 1 year ago - 2 comments
Labels: enhancement

#230 - Update game logic and error handling in spiel.py

Pull Request - State: open - Opened by leoni-q about 1 year ago

#229 - cross play between models

Issue - State: open - Opened by dmtrung14 about 1 year ago
Labels: enhancement

#228 - What does the `replay_buffer.pkl` do?

Issue - State: closed - Opened by dmtrung14 about 1 year ago - 1 comment

#227 - How can I use a pre-trained model?

Issue - State: open - Opened by worldsoft over 1 year ago - 1 comment
Labels: enhancement

#226 - MuZero choose the same action

Issue - State: open - Opened by sdumi03 over 1 year ago
Labels: bug

#225 - Muzero crashes when choosing spiel/backgammon

Issue - State: open - Opened by artshar over 1 year ago
Labels: bug

#224 - Fix: gym v26 migration

Pull Request - State: open - Opened by lduchosal over 1 year ago

#223 - Breakout: ModuleNotFoundError shows gym[atari] instead of python-opencv

Issue - State: open - Opened by velicanerdem over 1 year ago - 1 comment
Labels: bug

#222 - question about action encoded

Issue - State: open - Opened by Nightbringers almost 2 years ago
Labels: bug

#221 - The difference between offical pseudo code and this repository about "num_unroll_steps"

Issue - State: open - Opened by ZF4444 almost 2 years ago
Labels: bug

#220 - Dirichlet noise added outside of training

Issue - State: open - Opened by TommyX12 almost 2 years ago
Labels: bug

#219 - Update gym package to gymnasium

Issue - State: open - Opened by Mlokos almost 2 years ago - 1 comment
Labels: enhancement

#218 - fix: fixed ray's error 'No module named aiohttp.signals'

Pull Request - State: open - Opened by ChunchangShao about 2 years ago

#217 - Without Selfplay, why 1 games on a running??

Issue - State: open - Opened by dlrlfkr11 about 2 years ago - 2 comments
Labels: enhancement

#216 - [WIP] Sampled Muzero

Pull Request - State: open - Opened by JosephDenman about 2 years ago - 1 comment

#215 - Mean_value plot in Total_reward - Interpretation

Issue - State: open - Opened by SunilaAkbar about 2 years ago

#214 - Question about the dimension of value and reward network

Issue - State: open - Opened by jiachengc about 2 years ago
Labels: bug

#213 - Can't train using GPU? The torch version for this environment is '1.10.0cpu', that is, CPU one.

Issue - State: open - Opened by SunilaAkbar over 2 years ago - 2 comments
Labels: enhancement

#212 - Question about the perspective transformation of two players when calculating Q?

Issue - State: open - Opened by puyuan1996 over 2 years ago

#211 - The model does not converge for breakout

Issue - State: open - Opened by yungangwu over 2 years ago - 13 comments
Labels: enhancement

#210 - Only One Player: Can we use MuZero?

Issue - State: open - Opened by 1121091694 over 2 years ago - 2 comments
Labels: enhancement

#209 - Switch architecture to parallel sync ray tasks

Pull Request - State: closed - Opened by cmarlin over 2 years ago

#208 - TypeError: can't pickle function objects

Issue - State: open - Opened by OopsYouDiedE over 2 years ago
Labels: bug

#207 - Question: Does muzero-general support 2 player games with simultaneous action selection?

Issue - State: open - Opened by moscoso over 2 years ago - 3 comments

#206 - Uncertainty pls

Pull Request - State: closed - Opened by Dirichi over 2 years ago - 1 comment

#205 - Small ensemble

Pull Request - State: closed - Opened by Dirichi over 2 years ago - 1 comment

#204 - Consistency

Pull Request - State: closed - Opened by Dirichi over 2 years ago

#203 - Remove Batch Norm?

Issue - State: open - Opened by verbose-void over 2 years ago
Labels: enhancement

#202 - OpenGL rendering on a remote server over X11

Issue - State: open - Opened by jrjbertram over 2 years ago
Labels: bug

#201 - [Question] Is the environment required to have no hidden information?

Issue - State: closed - Opened by ZhengWenZhang over 2 years ago

#200 - sampling in continuous/complex action spaces with 'density prior' is not working

Issue - State: open - Opened by ManorZ over 2 years ago
Labels: bug

#199 - Batch MCTS

Issue - State: open - Opened by szrlee over 2 years ago
Labels: enhancement

#198 - raw install has ray problem

Issue - State: open - Opened by EngrStudent over 2 years ago - 1 comment
Labels: bug

#197 - Render model

Issue - State: open - Opened by theeduardomora over 2 years ago - 1 comment
Labels: bug

#196 - Is there a place we can see and share results for each game?

Issue - State: open - Opened by onegigbyte over 2 years ago

#195 - custom observation transformation

Issue - State: open - Opened by SimpleMathmatics over 2 years ago

#194 - Fix requirements in ci

Pull Request - State: closed - Opened by werner-duvaud over 2 years ago
Labels: bug

#193 - Policy target after MCTS should be in form of probabilities

Issue - State: open - Opened by 2M-kotb over 2 years ago - 1 comment

#192 - my first commit

Pull Request - State: closed - Opened by aliigii almost 3 years ago

#191 - Sampled MuZero implementation

Issue - State: open - Opened by matthiaskiller almost 3 years ago - 1 comment
Labels: enhancement

#190 - fix a bug due to the use of DataParallel

Pull Request - State: closed - Opened by vincentzhang almost 3 years ago

#189 - Can muzero learn to play two different games at the same time

Issue - State: open - Opened by lwaif almost 3 years ago - 1 comment
Labels: enhancement

#188 - Add workflow

Pull Request - State: closed - Opened by ahainaut almost 3 years ago

#187 - Add github workflow

Pull Request - State: closed - Opened by ahainaut almost 3 years ago

#186 - Strange observations and actions in continuous implementation.

Issue - State: closed - Opened by dylanamiller almost 3 years ago - 1 comment

#185 - MuZero Unplugged

Issue - State: open - Opened by tbskrpmnns almost 3 years ago - 7 comments
Labels: enhancement, question

#184 - Why is root.visit_count initialized to 0 and root_predicted_value not included in root node value?

Issue - State: open - Opened by dniku almost 3 years ago
Labels: enhancement

#183 - procgen

Issue - State: open - Opened by hlsfin about 3 years ago
Labels: enhancement, question

#182 - ray.exceptions.RayActorError: The actor died unexpectedly before finishing this task.

Issue - State: open - Opened by hairinwind about 3 years ago - 1 comment

#181 - Why my game does not remember the steps trained

Issue - State: closed - Opened by hairinwind about 3 years ago - 3 comments

#180 - training result cannot be loaded on another machine

Issue - State: open - Opened by hairinwind about 3 years ago - 1 comment

#178 - Scaling of historical stacked observations

Issue - State: closed - Opened by tuero about 3 years ago - 1 comment

#177 - added support for multiple dimension continuous action spaces

Pull Request - State: open - Opened by devin-m-NRL over 3 years ago - 1 comment

#176 - Total Training Reward rises then drops again

Issue - State: closed - Opened by annahambi over 3 years ago - 2 comments
Labels: question

#175 - Entropy loss in continuous actions

Issue - State: closed - Opened by 2M-kotb over 3 years ago - 4 comments

#174 - RuntimeError: module must have its parameters and buffers on device cuda:0 (device_ids[0]) but found one of them on device: cpu

Issue - State: open - Opened by lukaszkn over 3 years ago - 3 comments

#173 - Is there a more optimized way to complete training? Every game has to be learned for so long, people learn to master it early

Issue - State: closed - Opened by lwaif over 3 years ago - 1 comment
Labels: question

#172 - Faster calculations in self_play.py

Pull Request - State: closed - Opened by bibidybop over 3 years ago

#171 - Struggling to get Ray working

Issue - State: open - Opened by SheldonCurtiss over 3 years ago
Labels: question

#170 - Target Value Offset

Issue - State: closed - Opened by dans-acc over 3 years ago - 1 comment

#169 - Optimization of some parameters for tictactoe.

Pull Request - State: open - Opened by AdrianAcala over 3 years ago - 4 comments

#168 - Add progress bar with speed and estimation

Pull Request - State: closed - Opened by LeoVS09 over 3 years ago

#167 - Scrabble implementation - How to include Player's rack observation

Issue - State: closed - Opened by nicolasnijssen over 3 years ago - 2 comments

#166 - Would it possible to write go game and chess game program?

Issue - State: closed - Opened by leqingli2000 over 3 years ago - 1 comment
Labels: enhancement

#165 - Dimensionality issue in continuous action space

Issue - State: closed - Opened by alik604 over 3 years ago - 1 comment

#164 - could it run without ray?

Issue - State: closed - Opened by hilberthu over 3 years ago - 2 comments
Labels: question

#163 - File not found ray distributed cluster worker node save checkpoint

Issue - State: open - Opened by saintpoida over 3 years ago - 3 comments

#162 - If I know the environment, is it better to train alphazero?

Issue - State: open - Opened by omgmax over 3 years ago - 1 comment
Labels: question

#161 - 2 players moving simultaneously

Issue - State: open - Opened by omgmax over 3 years ago - 2 comments
Labels: question

#160 - Adapting to RLLib

Issue - State: closed - Opened by SebastianBodza over 3 years ago - 1 comment
Labels: enhancement

#159 - Self-play very slow and inefficient on GPU (self.selfplay_on_gpu = True)

Issue - State: open - Opened by kevaday almost 4 years ago - 1 comment
Labels: enhancement

#158 - How to adapt Muzero to financial trading?

Issue - State: closed - Opened by Ray-0403 almost 4 years ago - 22 comments

#157 - RAM memory usage of SelfPlay.continuous_self_play() keeps growing

Issue - State: open - Opened by adalsteinnpals almost 4 years ago - 8 comments
Labels: help wanted

#156 - Input of representation function is 131 planes instead of 128 !!

Issue - State: closed - Opened by 2M-kotb almost 4 years ago

#155 - Training step always 0

Issue - State: open - Opened by yeekit24 almost 4 years ago - 2 comments

#153 - Reward computation based on next state instead of (state, action)

Issue - State: closed - Opened by FXDevailly almost 4 years ago - 3 comments
Labels: question

#152 - Interpreting the training curves on Tic Tac Toe

Issue - State: closed - Opened by itmorn almost 4 years ago - 13 comments
Labels: documentation

#151 - Keep replay buffer on disk (not in memory), allowing it to grow to any size.

Pull Request - State: open - Opened by me-unsolicited almost 4 years ago - 2 comments

#150 - [3090 rtx] Very slow training on resnet with 1 block

Issue - State: closed - Opened by HadiSDev almost 4 years ago - 5 comments

#149 - Render history

Pull Request - State: open - Opened by egafni almost 4 years ago

#144 - Why did I train lunarlander so slowly？more than 1 hour used 1 gpu

Issue - State: closed - Opened by Augustiu almost 4 years ago - 1 comment
Labels: question

#143 - Alpha Zero / MuZero differences

Issue - State: closed - Opened by Counterfeiter almost 4 years ago - 5 comments
Labels: enhancement, question

#142 - Updates on Reanalyse / Sample Efficiency (Re-executing MCTS, Parallelization, Stabilization with a target model, etc.)

Pull Request - State: open - Opened by FXDevailly almost 4 years ago - 6 comments

#141 - Only single step two player mode, no real two player mode!?

Issue - State: closed - Opened by Counterfeiter almost 4 years ago - 2 comments
Labels: enhancement, question

#139 - RuntimeError: cuDNN error: CUDNN_STATUS_NOT_INITIALIZED

Issue - State: closed - Opened by theword almost 4 years ago - 10 comments

#138 - Reason for not re-executing MCTS and updating policy "targets" (child visits) in "Reanalyze" ?

Issue - State: closed - Opened by FXDevailly almost 4 years ago - 9 comments
Labels: enhancement

#135 - Can training continue from a previous trained model?

Issue - State: closed - Opened by atalapan almost 4 years ago - 4 comments

#132 - Using TPUs

Issue - State: open - Opened by StrangeTcy about 4 years ago - 1 comment
Labels: enhancement

#130 - POMDP

Issue - State: closed - Opened by WenyuHan-LiNa about 4 years ago - 5 comments
Labels: question

#127 - Improving Atari hyperparameters

Issue - State: open - Opened by xiaolonghao about 4 years ago - 4 comments
Labels: question

#103 - Add open_spiel game wrapper

Pull Request - State: closed - Opened by goshawk22 about 4 years ago - 2 comments

#100 - Typo In muzero.py comments

Issue - State: closed - Opened by FrancescoVassalli about 4 years ago - 1 comment

GitHub / werner-duvaud/muzero-general issues and pull requests