reinforcement-learning
Here are 6,238 public repositories matching this topic...
-
Updated
Apr 12, 2021 - Python
-
Updated
Apr 13, 2021 - C#
-
Updated
Apr 6, 2021 - Python
-
Updated
Apr 5, 2021 - Python
-
Updated
Apr 11, 2021
-
Updated
Mar 12, 2021
-
Updated
Jan 15, 2021 - Jupyter Notebook
-
Updated
Apr 10, 2021 - C++
Vcpkg is a C++ dependency management system that makes installation and consumption as a dependency very easy. We should support this for VW to allow consuming the lib as easy as possible.
Instructions for creating a new package can be found here: https://github.com/microsoft/vcpkg/blob/master/docs/examples/packaging-github-repos.md
-
Updated
Jul 29, 2020 - Python
-
Updated
Mar 17, 2021 - Python
-
Updated
Nov 1, 2020 - Python
-
Updated
Mar 25, 2021
-
Updated
Apr 3, 2021 - Python
Bidirectional RNN
Is there a way to train a bidirectional RNN (like LSTM or GRU) on trax nowadays?
-
Updated
Mar 2, 2021 - Python
-
Updated
Apr 1, 2021 - Jupyter Notebook
-
Updated
Nov 20, 2020 - Python
-
Updated
Mar 30, 2021 - Jupyter Notebook
-
Updated
Dec 14, 2019 - Jupyter Notebook
-
Updated
Mar 18, 2020 - JavaScript
-
Updated
Apr 8, 2021 - Jupyter Notebook
-
Updated
Mar 30, 2021
-
Updated
Apr 10, 2021
-
Updated
Jun 30, 2020 - Jupyter Notebook
-
Updated
Jun 21, 2019 - C++
-
Updated
Sep 10, 2020 - Python
The following applies to DDPG and TD3, and possibly other models. The following libraries were installed in a virtual environment:
numpy==1.16.4
stable-baselines==2.10.0
gym==0.14.0
tensorflow==1.14.0
Episode rewards do not seem to be updated in model.learn() before callback.on_step(). Depending on which callback.locals variable is used, this means that:
- episode rewards may n
Improve this page
Add a description, image, and links to the reinforcement-learning topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the reinforcement-learning topic, visit your repo's landing page and select "manage topics."
Right now, grid search variables are resolved before the random samples are generated.
If we could toggle the order of resolution, we could support this: https://discuss.ray.io/t/is-there-a-way-to-run-the-same-hyperparameter-configuration-multiple-times/1412/12