Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / opendilab/DI-engine issues and pull requests
#857 - feature(nyz&dcy): add LLM/VLM RLHF loss (PPO/GRPO/RLOO)
Pull Request -
State: open - Opened by PaParaZz1 8 days ago
- 1 comment
Labels: enhancement, algo
#856 - feature(wqj): add vllm collector
Pull Request -
State: open - Opened by wqj2004 14 days ago
#855 - polish(pu): delete unused enable_fast_timestep argument
Pull Request -
State: closed - Opened by puyuan1996 25 days ago
Labels: config
#854 - feature(nyz): add rlhf dataset
Pull Request -
State: closed - Opened by PaParaZz1 28 days ago
Labels: enhancement, data
#853 - feature(wqj): add vllm rlhf collector
Pull Request -
State: closed - Opened by wqj2004 about 1 month ago
Labels: enhancement
#852 - AttributeError: 'MultiDiscrete' object has no attribute 'low'
Issue -
State: closed - Opened by Rocky-CN about 1 month ago
- 3 comments
Labels: bug, env
#851 - enable_fast_timestep
Issue -
State: closed - Opened by neal2164 about 2 months ago
- 2 comments
Labels: algo
#850 - Noisy Net Issue
Issue -
State: open - Opened by neal2164 2 months ago
- 2 comments
Labels: bug, algo
#849 - LSTM Layer normalization
Issue -
State: closed - Opened by neal2164 2 months ago
- 2 comments
Labels: algo
#849 - LSTM Layer normalization
Issue -
State: closed - Opened by neal2164 2 months ago
- 2 comments
Labels: algo
#848 - Mistake
Issue -
State: closed - Opened by perfectconan 2 months ago
#848 - Mistake
Issue -
State: closed - Opened by perfectconan 2 months ago
#847 - Epsilon-Greedy Exploration in Hybrid DDPG
Issue -
State: closed - Opened by MarkHolmstrom 3 months ago
- 2 comments
Labels: algo
#846 - Bug in reset method
Issue -
State: closed - Opened by neal2164 3 months ago
- 1 comment
Labels: bug, algo
#845 - Reset env_info
Issue -
State: closed - Opened by neal2164 3 months ago
- 1 comment
Labels: bug, env
#844 - Priority Experience Replay Bug
Issue -
State: closed - Opened by neal2164 3 months ago
- 1 comment
Labels: bug, data
#843 - Issue with R2D2
Issue -
State: closed - Opened by neal2164 3 months ago
#842 - feature(pu): add ddp config of dqn and onppo
Pull Request -
State: closed - Opened by puyuan1996 3 months ago
Labels: efficiency optimization, config
#841 - feature(xyy):add HPT model to implement PolicyStem+DuelingHead
Pull Request -
State: closed - Opened by luodi-7 3 months ago
Labels: algo
#840 - :feature(xyy):add HPT model to implement PolicyStem+DuelingHead
Pull Request -
State: closed - Opened by luodi-7 3 months ago
#839 - add hpt model and corresponding examples.
Pull Request -
State: closed - Opened by luodi-7 3 months ago
#838 - conda安装di-engine,无法与python 3.9兼容
Issue -
State: closed - Opened by Rocky-CN 3 months ago
- 3 comments
Labels: bug
#837 - AttributeError: 'NoneType' object has no attribute 'shape' when running simple_rl_train.py - Carla
Issue -
State: closed - Opened by tornado20092008 3 months ago
- 4 comments
Labels: env
#836 - Bugreport: 运行ptz_simple_spread_qmix_config.py会出错
Issue -
State: closed - Opened by agdlfksdhasdoi 3 months ago
- 3 comments
Labels: env
#835 - feature(pu): add resume_training option to allow the envstep and train_iter resume seamlessly
Pull Request -
State: closed - Opened by puyuan1996 4 months ago
Labels: config
#834 - gfootball no obs module
Issue -
State: closed - Opened by wenyu427 4 months ago
- 1 comment
Labels: env
#833 - feature(pu): add pistonball_env, its unittest and qmix config
Pull Request -
State: closed - Opened by puyuan1996 4 months ago
Labels: env, config
#832 - pettingzoo报错,并且只支持‘simple_spread_v2’。请问什么时候支持其他的呢
Issue -
State: closed - Opened by 670555467 5 months ago
- 7 comments
Labels: env
#831 - polish(TairanMK): update trading env
Pull Request -
State: closed - Opened by TairanMK 5 months ago
- 1 comment
#829 - polish(mark): add hybrid action space support to ActionNoiseWrapper
Pull Request -
State: closed - Opened by MarkHolmstrom 5 months ago
Labels: enhancement
#828 - feature(whl): add AWR algorithm.
Pull Request -
State: closed - Opened by kxzxvbk 5 months ago
- 1 comment
Labels: algo
#827 - I'm
Issue -
State: closed - Opened by Kindo96 5 months ago
- 2 comments
#826 - No Hidden Size List for ContinuousQAC?
Issue -
State: closed - Opened by MarkHolmstrom 6 months ago
- 1 comment
Labels: discussion, algo
#823 - Observation shape in the custom marl environment
Issue -
State: closed - Opened by WangJuan6 7 months ago
- 5 comments
Labels: discussion, env
#822 - Export multi-agent policies and shared training
Issue -
State: closed - Opened by ardian-selmonaj 7 months ago
- 1 comment
Labels: discussion, algo
#821 - feature(zjow): add Implicit Q-Learning
Pull Request -
State: closed - Opened by zjowowen 7 months ago
Labels: algo
#820 - Unexpected increase in memory overhead and Multi-GPU guidance in custom environments
Issue -
State: closed - Opened by WangJuan6 7 months ago
- 2 comments
Labels: discussion, parallel-dist
#818 - About Replicating the PPO Performance in the Hopper-V3 Environment
Issue -
State: closed - Opened by hyLiu1994 8 months ago
- 3 comments
Labels: env, config
#817 - feature(nyz): adapt DingEnvWrapper to gymnasium
Pull Request -
State: closed - Opened by PaParaZz1 8 months ago
- 2 comments
Labels: enhancement, env
#816 - How to initialize semantic model in subprocess mode
Issue -
State: closed - Opened by WangJuan6 8 months ago
- 2 comments
Labels: good first issue, env
#815 - How to define close function in custom environment
Issue -
State: closed - Opened by WangJuan6 8 months ago
- 3 comments
Labels: good first issue, env
#814 - RuntimeError: mat1 and mat2 shapes cannot be multiplied (4x138 and 64x138)
Issue -
State: closed - Opened by WangJuan6 8 months ago
- 3 comments
Labels: bug
#813 - How to introduce other optimizers into DI-engine?
Issue -
State: open - Opened by weidaolee 8 months ago
- 2 comments
Labels: good first issue, algo
#812 - KeyError: 'obs_shape'
Issue -
State: closed - Opened by WangJuan6 8 months ago
- 2 comments
Labels: bug, config
#811 - style(nyz): relax flask requirement
Pull Request -
State: closed - Opened by PaParaZz1 8 months ago
- 1 comment
Labels: test
#810 - No example for Trading Environments
Issue -
State: closed - Opened by eramosr16 8 months ago
- 1 comment
Labels: env, config
#809 - feature(zym): update ppo config to support discrete action space
Pull Request -
State: closed - Opened by YinminZhang 8 months ago
Labels: config
#808 - feature(wrh): add EDT code
Pull Request -
State: open - Opened by ruiheng123 8 months ago
Labels: algo
#807 - feature(wrh): add taxi env latest version and dqn config
Pull Request -
State: closed - Opened by ruiheng123 8 months ago
Labels: config
#806 - feature(wrh): add edt algorithm
Pull Request -
State: closed - Opened by ruiheng123 8 months ago
Labels: algo
#805 - style(hus): add new badge (hellogithub) in readme
Pull Request -
State: closed - Opened by TuTuHuss 8 months ago
Labels: doc
#803 - 马里奥代码咨询
Issue -
State: closed - Opened by wenxueliu 8 months ago
- 1 comment
Labels: discussion
#802 - feature(wrh): taxi_dqn_config.py update
Pull Request -
State: closed - Opened by ruiheng123 9 months ago
- 1 comment
Labels: config
#801 - polish(zym): optimize ppo continuous act
Pull Request -
State: closed - Opened by YinminZhang 9 months ago
Labels: config
#800 - gym_anytrading : could not broadcast input array from shape (62,) into shape (20,3) Please help!!
Issue -
State: closed - Opened by zhipentian 9 months ago
- 3 comments
Labels: bug, env
#799 - feature(wrh): add taxi env
Pull Request -
State: closed - Opened by ruiheng123 9 months ago
Labels: env
#798 - bug when running MARL algorithm Qmix in pettingzoo
Issue -
State: closed - Opened by cymmerida123 9 months ago
- 3 comments
Labels: bug
#797 - BrokenPipeError: [WinError 232] 管道正在被关闭 has occurred, when running MARL algorithm QMIX in pettingzoo
Issue -
State: closed - Opened by cymmerida123 9 months ago
#796 - cannot run GTrXL demo since v0.5.0
Issue -
State: closed - Opened by klcheungaj 9 months ago
- 1 comment
Labels: bug
#795 - doc(hus): update discord link and badge in readme
Pull Request -
State: closed - Opened by TuTuHuss 10 months ago
Labels: doc
#793 - docker内运行lunarlander_dqn_deploy失败
Issue -
State: closed - Opened by Eric-Zhao1 10 months ago
- 6 comments
Labels: bug, docker
#792 - question for SMAC
Issue -
State: closed - Opened by bihanbihan 10 months ago
- 3 comments
Labels: env
#791 - get "TypeError: __init__() got an unexpected keyword argument 'agent_obs_shape'" when running " python3 -u smac_5m6m_masac_config.py"
Issue -
State: closed - Opened by SiriusZbz 10 months ago
- 2 comments
Labels: bug
#790 - how to get the ckpt file?
Issue -
State: closed - Opened by SiriusZbz 10 months ago
- 2 comments
Labels: bug
#789 - TD3应用混合动作空间报错,AssertionError
Issue -
State: closed - Opened by dajianer 11 months ago
- 2 comments
Labels: bug
#788 - feature(nyz): add GPU utils
Pull Request -
State: closed - Opened by PaParaZz1 11 months ago
- 1 comment
Labels: efficiency optimization
#787 - 如何获取每个episode的reward值
Issue -
State: closed - Opened by dajianer 11 months ago
- 1 comment
Labels: discussion
#786 - fix(zjow): fix complex obs demo for ppo pipeline
Pull Request -
State: closed - Opened by zjowowen 11 months ago
- 1 comment
Labels: bug
#785 - fix(dajianer): add 'collect_kwargs' to the keep function of OnlineRLContext
Pull Request -
State: closed - Opened by dajianer 11 months ago
- 1 comment
Labels: bug
#784 - 混合动作空间环境,PPO使用gae_estimator报错
Issue -
State: closed - Opened by dajianer 11 months ago
- 3 comments
Labels: bug
#783 - feature(xrk): add q-transformer
Pull Request -
State: open - Opened by rongkunxue 11 months ago
Labels: algo
#782 - env(rjy): add ising model env
Pull Request -
State: closed - Opened by nighood 12 months ago
- 1 comment
Labels: env
#781 - feature(xrk): add new env named Flozen Lake and DQN algorithm.
Pull Request -
State: closed - Opened by rongkunxue 12 months ago
- 1 comment
Labels: env
#780 - feature(xrk): add new env named Flozen Lake and DQN algorithm.
Pull Request -
State: closed - Opened by rongkunxue 12 months ago
#779 - feature(xrk): add new env named Flozen Lake and DQN algorithm.
Pull Request -
State: closed - Opened by rongkunxue 12 months ago
Labels: env
#778 - feature(ooo): add deprecated function decorator
Pull Request -
State: closed - Opened by ooooo-create 12 months ago
- 2 comments
Labels: enhancement
#776 - fix(eltociear): typo in config.py
Pull Request -
State: closed - Opened by eltociear about 1 year ago
- 1 comment
Labels: typo
#775 - FQF logit computation
Issue -
State: closed - Opened by dmartinezbaselga about 1 year ago
- 3 comments
Labels: discussion, algo
#774 - feature(nyz): add MADDPG pettingzoo example
Pull Request -
State: closed - Opened by PaParaZz1 about 1 year ago
Labels: algo, config
#773 - Implementation of Mean-Field MARL algorithm
Issue -
State: closed - Opened by openRiemann about 1 year ago
- 3 comments
Labels: algo
#771 - feature(zc): add MetaDiffuser and prompt-dt
Pull Request -
State: open - Opened by Super1ce about 1 year ago
Labels: algo
#770 - record a video
Issue -
State: closed - Opened by zhixiongzh about 1 year ago
- 2 comments
#769 - gym soccer是否有文档? 其参数设置以及action的类型该如何写
Issue -
State: closed - Opened by Joylessss about 1 year ago
- 3 comments
Labels: env
#768 - polish(pu): polish comments in a2c/bcq/fqf/ibc policy
Pull Request -
State: closed - Opened by puyuan1996 about 1 year ago
- 1 comment
Labels: config, doc
#767 - polish(pu): polish NGU atari configs
Pull Request -
State: closed - Opened by puyuan1996 about 1 year ago
- 1 comment
Labels: config
#765 - 尝试使用自定义环境出现问题
Issue -
State: closed - Opened by HawkQ about 1 year ago
- 2 comments
Labels: env
#764 - polish(rjy): polish pg/iqn/edac policy doc
Pull Request -
State: closed - Opened by nighood about 1 year ago
Labels: doc
#763 - doc(zjow): polish the notation of classes and functions in torch_utils and utils
Pull Request -
State: closed - Opened by zjowowen about 1 year ago
- 1 comment
Labels: doc
#762 - doc(rjy): polish d4pg/ppg/qrdqn policy doc
Pull Request -
State: closed - Opened by nighood about 1 year ago
- 1 comment
Labels: doc
#761 - fix(pu): fix hppo entropy_weight to avoid nan error in log_prob
Pull Request -
State: closed - Opened by puyuan1996 about 1 year ago
- 1 comment
Labels: bug
#760 - H-PPO算法运行失败
Issue -
State: closed - Opened by Root970103 about 1 year ago
- 7 comments
Labels: bug
#759 - fix(zjow): fix bug in cliffwalking env
Pull Request -
State: closed - Opened by zjowowen about 1 year ago
Labels: bug
#758 - doc(zjow): add API doc for ding agent
Pull Request -
State: closed - Opened by zjowowen about 1 year ago
Labels: doc
#757 - feature(zjow): add qgpo policy for new DI-engine pipeline
Pull Request -
State: closed - Opened by zjowowen about 1 year ago
- 1 comment
Labels: algo
#755 - polish(rjy): polish the comments of collate_fn/profiler_helper/metric
Pull Request -
State: closed - Opened by nighood about 1 year ago
Labels: doc
#754 - feature(luyd): fix dt new pipeline of mujoco
Pull Request -
State: closed - Opened by AltmanD about 1 year ago
#753 - feature(zjow): add envpool new pipeline
Pull Request -
State: open - Opened by zjowowen about 1 year ago
Labels: enhancement
#752 - polish(rjy): polish comments in normalizer_helper and lock_helper
Pull Request -
State: closed - Opened by nighood over 1 year ago
Labels: doc
#751 - 代码报错:在配置好conda环境以及将该项目fork到本地后,在运行DI-engine/dizoo/petting_zoo/config/路径下的所有py文件(如ptz_simple_spread_madqn_config.py;ptz_simple_spread_mappo_config.py等)时均出现报错
Issue -
State: closed - Opened by QingYuanZi1024 over 1 year ago
- 3 comments
#750 - what algorithm do you use to sovle the overcooked problem? MADDPG?
Issue -
State: closed - Opened by frandoLin over 1 year ago
- 3 comments
Labels: discussion