gym
Here are 609 public repositories matching this topic...
I'll post it as a question as I am not quite sure that it is a bug. I have been experimenting for a while with the library in a custom environment for a school project and I am really interested in the reproducibility of the result. I have read the disclaimer in the documentation that reads that reproducible results are not guaranteed across multiple platforms or different versions of Pytorch. Ho
I would love for Wger to become the central database that all FOSS projects pull workout information from. However, workout apps tend to use very different art styles — wger itself uses line art, openWorkout uses 3D animation, Feeel uses low-poly images.
It'd be great if volunteers could upload graphics in any o
-
Updated
Feb 6, 2021 - Python
-
Updated
Jan 25, 2021 - Python
-
Updated
Apr 25, 2021 - Python
Per this comment in #12
-
Updated
Jun 13, 2021 - Jupyter Notebook
-
Updated
Apr 5, 2021 - Python
Problem Description
Procgen Environments (https://github.com/openai/procgen) are new environments to test out the generalization ability of agents. It would be nice to include some of the games into the Open RL Benchmark (http://benchmark.cleanrl.dev/)
This is a good first issue for contributors. I think contributors can simply modify the network model slightly (https://github.com/vwxyzjn/c
-
Updated
Jul 14, 2019 - Python
-
Updated
May 31, 2020 - Python
-
Updated
Jul 20, 2021 - Python
-
Updated
Oct 1, 2020 - Python
There seem to be some vulnerabilities in our code that might fail easily. I suggest adding more unit tests for the following:
- Custom agents (there's only VPG and PPO on CartPole-v0 as of now. We should preferably add more to cover discrete-offpolicy, continuous-offpolicy and continuous-onpolicy)
- Evaluation for the Bandits and Classical agents
- Testing of convergence of agents as proposed i
-
Updated
May 8, 2020 - JavaScript
-
Updated
Jul 21, 2021 - Jupyter Notebook
-
Updated
Jul 13, 2021 - Python
-
Updated
Jul 20, 2021 - Python
-
Updated
Jul 4, 2019 - Python
-
Updated
Jul 12, 2021 - Python
-
Updated
Apr 16, 2017 - Python
-
Updated
Jun 29, 2021 - Python
-
Updated
Nov 6, 2020 - Python
-
Updated
Mar 12, 2021 - Jupyter Notebook
-
Updated
Jul 20, 2018 - Python
Improve this page
Add a description, image, and links to the gym topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the gym topic, visit your repo's landing page and select "manage topics."
The following applies to DDPG and TD3, and possibly other models. The following libraries were installed in a virtual environment:
numpy==1.16.4
stable-baselines==2.10.0
gym==0.14.0
tensorflow==1.14.0
Episode rewards do not seem to be updated in
model.learn()beforecallback.on_step(). Depending on whichcallback.localsvariable is used, this means that: