rllib
Here are 36 public repositories matching this topic...
-
Updated
Jun 2, 2021 - Python
-
Updated
Oct 23, 2020 - Python
-
Updated
May 21, 2021 - Jupyter Notebook
-
Updated
Jun 8, 2021 - Jupyter Notebook
-
Updated
Jun 19, 2021 - Jupyter Notebook
Currently we use a very ad-hoc procedure for scaling the quadratic component of NAF when used for exploration:
https://github.com/angelolovatto/raylab/blob/9820275b17ee085e1955a6d845c0bdf61333f8da/raylab/algorithms/naf/naf_policy.py#L150-L155
A possibly better alternative would be to scale it based on the desired average action stddev. Something like:
scale_tril * (1.0 / average_st-
Updated
Nov 17, 2020 - Python
-
Updated
May 29, 2021 - Jupyter Notebook
-
Updated
May 13, 2021 - Python
-
Updated
May 22, 2021 - Python
-
Updated
Jun 17, 2021 - Python
-
Updated
Dec 8, 2020 - Python
-
Updated
Sep 6, 2019 - Python
-
Updated
Jun 12, 2019 - Python
-
Updated
Jun 8, 2021 - Python
-
Updated
Dec 15, 2020 - Jupyter Notebook
-
Updated
Aug 25, 2020 - Jupyter Notebook
-
Updated
Nov 27, 2020 - Jupyter Notebook
-
Updated
Aug 11, 2020 - Python
-
Updated
Mar 25, 2021 - Jupyter Notebook
-
Updated
Sep 30, 2019 - Python
-
Updated
May 26, 2019 - Python
-
Updated
May 24, 2021 - Python
Implement parameterized linear and non-linear dynamics model with bimodal Gaussian noise.
-
Updated
Jun 17, 2021 - Python
Improve this page
Add a description, image, and links to the rllib topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the rllib topic, visit your repo's landing page and select "manage topics."
Trying out a simple example using TuneSearchCV with LGBMClassifier and it fails on start.
Environment:
Python 3.8.3
tune-sklearn 0.3.0
ray 1.3.0
macos mojave 10.14.6
Code: