Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / Stable-Baselines-Team/stable-baselines3-contrib issues and pull requests

#274 - Fix crash using SubprocVecEnv with MaskablePPO (#49)

Pull Request - State: closed - Opened by KiuIras 21 days ago - 1 comment

#273 - [Feature Request] Group Relative Proximity Optimization (GRPO)

Issue - State: open - Opened by Soham4001A 21 days ago - 2 comments
Labels: enhancement

#272 - GPRO - Feature Addition

Pull Request - State: open - Opened by Soham4001A 21 days ago

#271 - Release 2.5.0

Pull Request - State: closed - Opened by araffin 23 days ago

#270 - [Bug]: in "RecurrentPPO" not work "model.policy.evaluate_actions()"

Issue - State: open - Opened by drulye about 1 month ago - 2 comments
Labels: bug, more information needed, check the checklist

#269 - [Feature Request] Support for multi input policies in CrossQ

Issue - State: open - Opened by RaikoPipe about 1 month ago - 1 comment
Labels: enhancement

#268 - Added MultiInputPolicy support to CrossQ

Pull Request - State: open - Opened by RaikoPipe about 1 month ago

#267 - [Feature Request] ACERAC

Issue - State: open - Opened by lychanl 3 months ago - 2 comments
Labels: enhancement

#266 - Add policy documentation links to policy_kwargs parameter

Pull Request - State: closed - Opened by kplers 3 months ago

#265 - [Question] Not updating lstm states during training

Issue - State: open - Opened by abhinavj98 3 months ago - 1 comment
Labels: question

#264 - Add missing condition in CI

Pull Request - State: closed - Opened by araffin 3 months ago

#263 - Drop python 3.8, add python 3.12 support

Pull Request - State: closed - Opened by araffin 3 months ago

#262 - Release v2.4.0

Pull Request - State: closed - Opened by araffin 3 months ago

#261 - Add support for gymnasium v1.0

Pull Request - State: closed - Opened by araffin 4 months ago

#260 - Update deps for read the doc

Pull Request - State: closed - Opened by araffin 4 months ago

#259 - Fix QRDQN loading `target_update_interval`

Pull Request - State: closed - Opened by jak3122 5 months ago

#258 - [Bug]: loading QRDQN changes target_update_interval

Issue - State: closed - Opened by jak3122 5 months ago
Labels: bug

#257 - [Question] Why can't directly use the PPO (RecurrentActorCriticPolicy, "CartPole - v1", verbose = 1)

Issue - State: open - Opened by dajianer 6 months ago - 1 comment
Labels: question, more information needed, check the checklist

#255 - Fix warning when loading a `RecurrentPPO` model

Pull Request - State: closed - Opened by araffin 6 months ago

#254 - [Bug]: FutureWarning: You are using `torch.load` with `weights_only=False`

Issue - State: closed - Opened by drulye 6 months ago - 3 comments
Labels: bug, more information needed

#253 - [Feature Request] same random seed for every env in AsyncEval

Issue - State: open - Opened by 1-Bart-1 7 months ago - 1 comment
Labels: enhancement, check the checkboxes

#252 - Update QR-DQN optimizer to only use q_net parameters

Pull Request - State: closed - Opened by corentinlger 8 months ago - 1 comment

#251 - Update SB3 and remove gSDE resampling

Pull Request - State: closed - Opened by araffin 8 months ago

#250 - [Question] Masked actions PPO in multiagent setting using PettigZoo

Issue - State: open - Opened by MarcoPicione 8 months ago
Labels: question

#249 - [Question] Apply Masking using ActionMasker on composite actions

Issue - State: closed - Opened by mwalidcharrwi 8 months ago - 4 comments
Labels: duplicate, question, more information needed

#248 - [Question] How to do pre-training on the RecurrentPPO MlpLstmPolicy

Issue - State: open - Opened by iwishiwasaneagle 9 months ago - 1 comment
Labels: question

#247 - MaskablePPO Masking Doesn't Work with Big Action Space

Issue - State: closed - Opened by orkunkn 10 months ago - 4 comments
Labels: custom gym env, check the checklist

#246 - RecurrentActorCriticPolicy Behaviour Not Clear

Issue - State: open - Opened by pasinit 10 months ago - 1 comment
Labels: documentation

#245 - TQC: ep_len_mean and ep_rew_mean does not match real values

Issue - State: open - Opened by btabia 10 months ago
Labels: bug, custom gym env, check the checkboxes

#244 - ep_len_mean discrepancy

Issue - State: closed - Opened by btabia 10 months ago
Labels: custom gym env

#243 - Implemented CrossQ

Pull Request - State: closed - Opened by danielpalen 10 months ago - 11 comments

#242 - Dependent Actions in MultiDiscrete Action Space

Issue - State: open - Opened by bbarisbaturay 10 months ago - 5 comments
Labels: question

#241 - [Question] Recurrent Maskable PPO ?!? Rudder ?!?

Issue - State: closed - Opened by tty666 10 months ago - 1 comment
Labels: duplicate, question, trading warning

#239 - [Question] RecurrentPPO: Reset LSTM states early?

Issue - State: open - Opened by phisad 11 months ago - 3 comments
Labels: enhancement, question

#238 - [Feature Request] Implement CrossQ

Issue - State: closed - Opened by danielpalen 11 months ago
Labels: enhancement

#237 - Fix typo in changelog

Pull Request - State: closed - Opened by araffin 11 months ago

#236 - Release v2.3.0

Pull Request - State: closed - Opened by araffin 11 months ago

#235 - Log success rate for PPO variants

Pull Request - State: closed - Opened by araffin 11 months ago

#234 - [Question] Why does MaskablePPO does not mask with some logic with last observation?

Issue - State: open - Opened by EloyAnguiano 11 months ago - 4 comments
Labels: question

#233 - Fix PPO maskable type annotations

Pull Request - State: closed - Opened by araffin 12 months ago

#232 - Update ruff and SB3 dependencies

Pull Request - State: closed - Opened by araffin 12 months ago

#231 - [Question] Simple way to implement data augmentation when training agent

Issue - State: closed - Opened by thomashirtz 12 months ago - 2 comments
Labels: question

#230 - [Question] LSTM observations

Issue - State: closed - Opened by suargi about 1 year ago - 3 comments
Labels: question

#229 - Fix `train_freq` type annotation for TQC and QR-DQN

Pull Request - State: closed - Opened by Armandpl about 1 year ago

#228 - Episodic training with TQC?

Issue - State: closed - Opened by Armandpl about 1 year ago - 2 comments
Labels: enhancement, question

#227 - Add note about MaskableEvalCallback

Pull Request - State: closed - Opened by icheered about 1 year ago

#226 - EvalCallback crashes Maskable PPO without error

Issue - State: closed - Opened by icheered about 1 year ago - 3 comments
Labels: documentation, help wanted, custom gym env

#225 - Update QRDQN defaults

Pull Request - State: closed - Opened by araffin about 1 year ago

#223 - [Feature Request] STAC algorithm

Issue - State: open - Opened by EloyAnguiano about 1 year ago - 4 comments
Labels: enhancement

#222 - [Question] how to use "lstm_states" from rollout_buffer to reconstruct LSTM states during training

Issue - State: closed - Opened by DeepRowLie about 1 year ago - 2 comments
Labels: question

#221 - [Bug]: producing NAN values during training in MaskablePPO

Issue - State: open - Opened by vahidqo about 1 year ago - 5 comments
Labels: bug, more information needed, custom gym env, No tech support

#220 - [Feature Request] Expand RNN Options and Algorithm Flexibility

Issue - State: open - Opened by mtnusf97 about 1 year ago - 2 comments
Labels: enhancement

#219 - Update `_process_sequence()` docstring

Pull Request - State: closed - Opened by rogerioagjr about 1 year ago

#218 - [Question] Recurrent PPO evaluation

Issue - State: closed - Opened by CAI23sbP over 1 year ago - 2 comments
Labels: question

#217 - Release v2.2.1: hotfix file closing

Pull Request - State: closed - Opened by araffin over 1 year ago

#216 - Release v2.2.0

Pull Request - State: closed - Opened by araffin over 1 year ago

#215 - Remove PyType and upgrade to latest SB3 version

Pull Request - State: closed - Opened by araffin over 1 year ago

#214 - Add rollout_buffer_class to TRPO

Pull Request - State: closed - Opened by ernestum over 1 year ago - 2 comments

#213 - Sync SB3 Contrib with SB3

Pull Request - State: closed - Opened by araffin over 1 year ago

#212 - Predicting actions after using MaskablePPO model outputs invalid action

Issue - State: closed - Opened by vivek-kumar9696 over 1 year ago - 2 comments
Labels: duplicate, question, RTFM

#211 - Recurrent PPO Not Training Well on a Very Simple Environment

Issue - State: open - Opened by sreejank over 1 year ago - 1 comment
Labels: custom gym env, No tech support

#210 - Worse training with Vectorized Environment

Issue - State: closed - Opened by pklochowicz over 1 year ago
Labels: more information needed, custom gym env, No tech support

#209 - How to use LSTM ? RecurrentPPO from sb3-contrib

Issue - State: closed - Opened by PedroIAgithub over 1 year ago - 6 comments
Labels: question

#208 - Maskable PPO selects illegal actions, altough everything looks correct

Issue - State: closed - Opened by DominikRoB over 1 year ago - 2 comments
Labels: duplicate, question, more information needed

#207 - Decrease in reward during training with MaskablePPO

Issue - State: open - Opened by vahidqo over 1 year ago
Labels: question, more information needed, custom gym env

#206 - [Feature Request] BBF algorithm implementation

Issue - State: open - Opened by Alian3785 over 1 year ago - 2 comments
Labels: enhancement

#205 - Speed up when using MaskablePPO

Issue - State: open - Opened by vahidqo over 1 year ago - 2 comments
Labels: question

#204 - Release v2.1.0

Pull Request - State: closed - Opened by araffin over 1 year ago

#203 - SACD Discrete Soft Actor Critic

Pull Request - State: open - Opened by splatter96 over 1 year ago - 3 comments

#202 - [Feature Request] Hybrid PPO

Issue - State: open - Opened by AlexPasqua over 1 year ago - 3 comments
Labels: enhancement

#201 - [Feature Request] Implement Recurrent SAC

Issue - State: open - Opened by masterdezign over 1 year ago - 17 comments
Labels: enhancement

#200 - [Bug]: inappropriate actions despite the MaskablePPO applied

Issue - State: closed - Opened by koliber31 over 1 year ago - 1 comment
Labels: custom gym env, No tech support, check the checkboxes

#199 - Bugfix/ppo mask stats window size

Pull Request - State: closed - Opened by PatrickHelm over 1 year ago - 3 comments

#198 - [Bug]: MaskablePPO ignores stats_window_size argument

Issue - State: closed - Opened by PatrickHelm over 1 year ago - 2 comments
Labels: bug, help wanted

#197 - [Question] Action mask dimensions for action combinations in a MultiDiscrete space

Issue - State: closed - Opened by npit over 1 year ago - 2 comments
Labels: question

#196 - [Question] Example running error about PPO

Issue - State: closed - Opened by LoveingStatistics over 1 year ago - 3 comments
Labels: duplicate, question

#195 - Problems with MaskablePPO

Issue - State: open - Opened by koliber31 over 1 year ago - 16 comments
Labels: custom gym env

#194 - Drop python 3.7, add 3.11 and update github templates

Pull Request - State: closed - Opened by araffin over 1 year ago

#193 - [Question] Would you like a pull request implementing classical tabular RL algorithms ?

Issue - State: open - Opened by Butanium over 1 year ago - 1 comment
Labels: question

#192 - Release v2.0.0

Pull Request - State: closed - Opened by araffin over 1 year ago

#191 - [Question] What's the best way to store aditional data in transitions for an OffPolicyAlgorithm

Issue - State: closed - Opened by Butanium over 1 year ago - 6 comments
Labels: question

#190 - Update version and fix #188

Pull Request - State: closed - Opened by araffin over 1 year ago

#188 - Note for later: update build script

Issue - State: closed - Opened by araffin over 1 year ago
Labels: enhancement

#187 - [Question] what would I got if I manage the train like this in SubprocVecEnv?

Issue - State: closed - Opened by Pborz almost 2 years ago - 5 comments
Labels: question

#186 - Timestamp as observation

Issue - State: closed - Opened by AminDar almost 2 years ago - 2 comments
Labels: question, custom gym env

#185 - Update AsyncEval seeding

Pull Request - State: closed - Opened by araffin almost 2 years ago

#184 - seems that python3.10 not include all sb3_contrib yet

Issue - State: closed - Opened by Pborz almost 2 years ago - 2 comments
Labels: more information needed

#183 - Architecture of PPO LSTM

Issue - State: closed - Opened by anilkurkcu almost 2 years ago - 5 comments
Labels: question

#182 - Update doc: switch from Gym to Gymnasium

Pull Request - State: closed - Opened by araffin almost 2 years ago

#181 - Issue with PIP

Issue - State: closed - Opened by anilkurkcu almost 2 years ago - 1 comment
Labels: bug

#180 - [Feature Request] Domain Randomization

Issue - State: open - Opened by KonstantinRamthun almost 2 years ago - 2 comments
Labels: enhancement

#179 - Recurrent PPO

Issue - State: closed - Opened by fede72bari almost 2 years ago - 4 comments
Labels: bug, more information needed

#178 - [Feature Request] Maskable EvalCallback support

Issue - State: closed - Opened by DnzJS almost 2 years ago - 2 comments
Labels: enhancement

#177 - How to use maskable PPO

Issue - State: closed - Opened by anilkurkcu almost 2 years ago - 1 comment
Labels: duplicate, question, RTFM