rllib
Here are 31 public repositories matching this topic...
-
Updated
Mar 27, 2021 - Python
-
Updated
Oct 23, 2020 - Python
-
Updated
Nov 25, 2020 - Jupyter Notebook
-
Updated
Mar 25, 2021 - Jupyter Notebook
Currently we use a very ad-hoc procedure for scaling the quadratic component of NAF when used for exploration:
https://github.com/angelolovatto/raylab/blob/9820275b17ee085e1955a6d845c0bdf61333f8da/raylab/algorithms/naf/naf_policy.py#L150-L155
A possibly better alternative would be to scale it based on the desired average action stddev. Something like:
scale_tril * (1.0 / average_st-
Updated
Apr 3, 2021 - Jupyter Notebook
-
Updated
Nov 17, 2020 - Python
-
Updated
Aug 30, 2020 - Jupyter Notebook
-
Updated
Dec 8, 2020 - Python
-
Updated
Aug 4, 2020 - Python
-
Updated
Mar 17, 2021 - Python
-
Updated
Sep 6, 2019 - Python
-
Updated
Jun 12, 2019 - Python
-
Updated
Aug 25, 2020 - Jupyter Notebook
-
Updated
Nov 27, 2020 - Jupyter Notebook
-
Updated
Mar 26, 2021 - Python
-
Updated
Mar 25, 2021 - Jupyter Notebook
-
Updated
Aug 11, 2020 - Python
Implement parameterized linear and non-linear dynamics model with bimodal Gaussian noise.
-
Updated
May 26, 2019 - Python
-
Updated
Jan 22, 2021 - Python
-
Updated
Sep 30, 2019 - Python
-
Updated
Dec 15, 2020 - Jupyter Notebook
-
Updated
Jan 14, 2020 - Python
-
Updated
Dec 16, 2020 - Python
Improve this page
Add a description, image, and links to the rllib topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the rllib topic, visit your repo's landing page and select "manage topics."
Right now, grid search variables are resolved before the random samples are generated.
If we could toggle the order of resolution, we could support this: https://discuss.ray.io/t/is-there-a-way-to-run-the-same-hyperparameter-configuration-multiple-times/1412/12