Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / werner-duvaud/muzero-general issues and pull requests
#233 - Api integration & won games statistic
Pull Request -
State: closed - Opened by slimyjimmy 15 days ago
#232 - Bump actions/download-artifact from 2 to 4.1.7 in /.github/workflows
Pull Request -
State: open - Opened by dependabot[bot] 6 months ago
Labels: dependencies
#231 - Chess and other non-trivial games
Issue -
State: open - Opened by StepHaze about 1 year ago
- 2 comments
Labels: enhancement
#230 - Update game logic and error handling in spiel.py
Pull Request -
State: open - Opened by leoni-q about 1 year ago
#229 - cross play between models
Issue -
State: open - Opened by dmtrung14 about 1 year ago
Labels: enhancement
#228 - What does the `replay_buffer.pkl` do?
Issue -
State: closed - Opened by dmtrung14 about 1 year ago
- 1 comment
#227 - How can I use a pre-trained model?
Issue -
State: open - Opened by worldsoft over 1 year ago
- 1 comment
Labels: enhancement
#226 - MuZero choose the same action
Issue -
State: open - Opened by sdumi03 over 1 year ago
Labels: bug
#225 - Muzero crashes when choosing spiel/backgammon
Issue -
State: open - Opened by artshar over 1 year ago
Labels: bug
#224 - Fix: gym v26 migration
Pull Request -
State: open - Opened by lduchosal over 1 year ago
#223 - Breakout: ModuleNotFoundError shows gym[atari] instead of python-opencv
Issue -
State: open - Opened by velicanerdem over 1 year ago
- 1 comment
Labels: bug
#222 - question about action encoded
Issue -
State: open - Opened by Nightbringers almost 2 years ago
Labels: bug
#221 - The difference between offical pseudo code and this repository about "num_unroll_steps"
Issue -
State: open - Opened by ZF4444 almost 2 years ago
Labels: bug
#220 - Dirichlet noise added outside of training
Issue -
State: open - Opened by TommyX12 almost 2 years ago
Labels: bug
#219 - Update gym package to gymnasium
Issue -
State: open - Opened by Mlokos almost 2 years ago
- 1 comment
Labels: enhancement
#218 - fix: fixed ray's error 'No module named aiohttp.signals'
Pull Request -
State: open - Opened by ChunchangShao about 2 years ago
#217 - Without Selfplay, why 1 games on a running??
Issue -
State: open - Opened by dlrlfkr11 about 2 years ago
- 2 comments
Labels: enhancement
#216 - [WIP] Sampled Muzero
Pull Request -
State: open - Opened by JosephDenman about 2 years ago
- 1 comment
#215 - Mean_value plot in Total_reward - Interpretation
Issue -
State: open - Opened by SunilaAkbar about 2 years ago
#214 - Question about the dimension of value and reward network
Issue -
State: open - Opened by jiachengc about 2 years ago
Labels: bug
#213 - Can't train using GPU? The torch version for this environment is '1.10.0cpu', that is, CPU one.
Issue -
State: open - Opened by SunilaAkbar over 2 years ago
- 2 comments
Labels: enhancement
#212 - Question about the perspective transformation of two players when calculating Q?
Issue -
State: open - Opened by puyuan1996 over 2 years ago
#211 - The model does not converge for breakout
Issue -
State: open - Opened by yungangwu over 2 years ago
- 13 comments
Labels: enhancement
#210 - Only One Player: Can we use MuZero?
Issue -
State: open - Opened by 1121091694 over 2 years ago
- 2 comments
Labels: enhancement
#209 - Switch architecture to parallel sync ray tasks
Pull Request -
State: closed - Opened by cmarlin over 2 years ago
#208 - TypeError: can't pickle function objects
Issue -
State: open - Opened by OopsYouDiedE over 2 years ago
Labels: bug
#207 - Question: Does muzero-general support 2 player games with simultaneous action selection?
Issue -
State: open - Opened by moscoso over 2 years ago
- 3 comments
#206 - Uncertainty pls
Pull Request -
State: closed - Opened by Dirichi over 2 years ago
- 1 comment
#205 - Small ensemble
Pull Request -
State: closed - Opened by Dirichi over 2 years ago
- 1 comment
#204 - Consistency
Pull Request -
State: closed - Opened by Dirichi over 2 years ago
#203 - Remove Batch Norm?
Issue -
State: open - Opened by verbose-void over 2 years ago
Labels: enhancement
#202 - OpenGL rendering on a remote server over X11
Issue -
State: open - Opened by jrjbertram over 2 years ago
Labels: bug
#201 - [Question] Is the environment required to have no hidden information?
Issue -
State: closed - Opened by ZhengWenZhang over 2 years ago
#200 - sampling in continuous/complex action spaces with 'density prior' is not working
Issue -
State: open - Opened by ManorZ over 2 years ago
Labels: bug
#199 - Batch MCTS
Issue -
State: open - Opened by szrlee over 2 years ago
Labels: enhancement
#198 - raw install has ray problem
Issue -
State: open - Opened by EngrStudent over 2 years ago
- 1 comment
Labels: bug
#197 - Render model
Issue -
State: open - Opened by theeduardomora over 2 years ago
- 1 comment
Labels: bug
#196 - Is there a place we can see and share results for each game?
Issue -
State: open - Opened by onegigbyte over 2 years ago
#195 - custom observation transformation
Issue -
State: open - Opened by SimpleMathmatics over 2 years ago
#194 - Fix requirements in ci
Pull Request -
State: closed - Opened by werner-duvaud over 2 years ago
Labels: bug
#193 - Policy target after MCTS should be in form of probabilities
Issue -
State: open - Opened by 2M-kotb over 2 years ago
- 1 comment
#192 - my first commit
Pull Request -
State: closed - Opened by aliigii almost 3 years ago
#191 - Sampled MuZero implementation
Issue -
State: open - Opened by matthiaskiller almost 3 years ago
- 1 comment
Labels: enhancement
#190 - fix a bug due to the use of DataParallel
Pull Request -
State: closed - Opened by vincentzhang almost 3 years ago
#189 - Can muzero learn to play two different games at the same time
Issue -
State: open - Opened by lwaif almost 3 years ago
- 1 comment
Labels: enhancement
#188 - Add workflow
Pull Request -
State: closed - Opened by ahainaut almost 3 years ago
#187 - Add github workflow
Pull Request -
State: closed - Opened by ahainaut almost 3 years ago
#186 - Strange observations and actions in continuous implementation.
Issue -
State: closed - Opened by dylanamiller almost 3 years ago
- 1 comment
#185 - MuZero Unplugged
Issue -
State: open - Opened by tbskrpmnns almost 3 years ago
- 7 comments
Labels: enhancement, question
#184 - Why is root.visit_count initialized to 0 and root_predicted_value not included in root node value?
Issue -
State: open - Opened by dniku almost 3 years ago
Labels: enhancement
#183 - procgen
Issue -
State: open - Opened by hlsfin about 3 years ago
Labels: enhancement, question
#182 - ray.exceptions.RayActorError: The actor died unexpectedly before finishing this task.
Issue -
State: open - Opened by hairinwind about 3 years ago
- 1 comment
#181 - Why my game does not remember the steps trained
Issue -
State: closed - Opened by hairinwind about 3 years ago
- 3 comments
#180 - training result cannot be loaded on another machine
Issue -
State: open - Opened by hairinwind about 3 years ago
- 1 comment
#178 - Scaling of historical stacked observations
Issue -
State: closed - Opened by tuero about 3 years ago
- 1 comment
#177 - added support for multiple dimension continuous action spaces
Pull Request -
State: open - Opened by devin-m-NRL over 3 years ago
- 1 comment
#176 - Total Training Reward rises then drops again
Issue -
State: closed - Opened by annahambi over 3 years ago
- 2 comments
Labels: question
#175 - Entropy loss in continuous actions
Issue -
State: closed - Opened by 2M-kotb over 3 years ago
- 4 comments
#174 - RuntimeError: module must have its parameters and buffers on device cuda:0 (device_ids[0]) but found one of them on device: cpu
Issue -
State: open - Opened by lukaszkn over 3 years ago
- 3 comments
#173 - Is there a more optimized way to complete training? Every game has to be learned for so long, people learn to master it early
Issue -
State: closed - Opened by lwaif over 3 years ago
- 1 comment
Labels: question
#172 - Faster calculations in self_play.py
Pull Request -
State: closed - Opened by bibidybop over 3 years ago
#171 - Struggling to get Ray working
Issue -
State: open - Opened by SheldonCurtiss over 3 years ago
Labels: question
#170 - Target Value Offset
Issue -
State: closed - Opened by dans-acc over 3 years ago
- 1 comment
#169 - Optimization of some parameters for tictactoe.
Pull Request -
State: open - Opened by AdrianAcala over 3 years ago
- 4 comments
#168 - Add progress bar with speed and estimation
Pull Request -
State: closed - Opened by LeoVS09 over 3 years ago
#167 - Scrabble implementation - How to include Player's rack observation
Issue -
State: closed - Opened by nicolasnijssen over 3 years ago
- 2 comments
#166 - Would it possible to write go game and chess game program?
Issue -
State: closed - Opened by leqingli2000 over 3 years ago
- 1 comment
Labels: enhancement
#165 - Dimensionality issue in continuous action space
Issue -
State: closed - Opened by alik604 over 3 years ago
- 1 comment
#164 - could it run without ray?
Issue -
State: closed - Opened by hilberthu over 3 years ago
- 2 comments
Labels: question
#163 - File not found ray distributed cluster worker node save checkpoint
Issue -
State: open - Opened by saintpoida over 3 years ago
- 3 comments
#162 - If I know the environment, is it better to train alphazero?
Issue -
State: open - Opened by omgmax over 3 years ago
- 1 comment
Labels: question
#161 - 2 players moving simultaneously
Issue -
State: open - Opened by omgmax over 3 years ago
- 2 comments
Labels: question
#160 - Adapting to RLLib
Issue -
State: closed - Opened by SebastianBodza over 3 years ago
- 1 comment
Labels: enhancement
#159 - Self-play very slow and inefficient on GPU (self.selfplay_on_gpu = True)
Issue -
State: open - Opened by kevaday almost 4 years ago
- 1 comment
Labels: enhancement
#158 - How to adapt Muzero to financial trading?
Issue -
State: closed - Opened by Ray-0403 almost 4 years ago
- 22 comments
#157 - RAM memory usage of SelfPlay.continuous_self_play() keeps growing
Issue -
State: open - Opened by adalsteinnpals almost 4 years ago
- 8 comments
Labels: help wanted
#156 - Input of representation function is 131 planes instead of 128 !!
Issue -
State: closed - Opened by 2M-kotb almost 4 years ago
#155 - Training step always 0
Issue -
State: open - Opened by yeekit24 almost 4 years ago
- 2 comments
#153 - Reward computation based on next state instead of (state, action)
Issue -
State: closed - Opened by FXDevailly almost 4 years ago
- 3 comments
Labels: question
#152 - Interpreting the training curves on Tic Tac Toe
Issue -
State: closed - Opened by itmorn almost 4 years ago
- 13 comments
Labels: documentation
#151 - Keep replay buffer on disk (not in memory), allowing it to grow to any size.
Pull Request -
State: open - Opened by me-unsolicited almost 4 years ago
- 2 comments
#150 - [3090 rtx] Very slow training on resnet with 1 block
Issue -
State: closed - Opened by HadiSDev almost 4 years ago
- 5 comments
#149 - Render history
Pull Request -
State: open - Opened by egafni almost 4 years ago
#144 - Why did I train lunarlander so slowly?more than 1 hour used 1 gpu
Issue -
State: closed - Opened by Augustiu almost 4 years ago
- 1 comment
Labels: question
#143 - Alpha Zero / MuZero differences
Issue -
State: closed - Opened by Counterfeiter almost 4 years ago
- 5 comments
Labels: enhancement, question
#142 - Updates on Reanalyse / Sample Efficiency (Re-executing MCTS, Parallelization, Stabilization with a target model, etc.)
Pull Request -
State: open - Opened by FXDevailly almost 4 years ago
- 6 comments
#141 - Only single step two player mode, no real two player mode!?
Issue -
State: closed - Opened by Counterfeiter almost 4 years ago
- 2 comments
Labels: enhancement, question
#139 - RuntimeError: cuDNN error: CUDNN_STATUS_NOT_INITIALIZED
Issue -
State: closed - Opened by theword almost 4 years ago
- 10 comments
#138 - Reason for not re-executing MCTS and updating policy "targets" (child visits) in "Reanalyze" ?
Issue -
State: closed - Opened by FXDevailly almost 4 years ago
- 9 comments
Labels: enhancement
#135 - Can training continue from a previous trained model?
Issue -
State: closed - Opened by atalapan almost 4 years ago
- 4 comments
#132 - Using TPUs
Issue -
State: open - Opened by StrangeTcy about 4 years ago
- 1 comment
Labels: enhancement
#130 - POMDP
Issue -
State: closed - Opened by WenyuHan-LiNa about 4 years ago
- 5 comments
Labels: question
#127 - Improving Atari hyperparameters
Issue -
State: open - Opened by xiaolonghao about 4 years ago
- 4 comments
Labels: question
#103 - Add open_spiel game wrapper
Pull Request -
State: closed - Opened by goshawk22 about 4 years ago
- 2 comments
#100 - Typo In muzero.py comments
Issue -
State: closed - Opened by FrancescoVassalli about 4 years ago
- 1 comment
#99 - Cartpole performance very slow
Issue -
State: closed - Opened by jarlva about 4 years ago
- 1 comment
#98 - windows TypeError: render() got an unexpected keyword argument 'format'
Issue -
State: closed - Opened by mikelty about 4 years ago
- 2 comments
#97 - When choose atari game and train , Display error global_worker
Issue -
State: closed - Opened by angpao about 4 years ago
- 1 comment
#96 - cartpole train
Issue -
State: closed - Opened by QilongPan about 4 years ago
- 1 comment
#95 - Loss not converging
Issue -
State: closed - Opened by jl1990 about 4 years ago
- 4 comments