rllib
Here are 29 public repositories matching this topic...
-
Updated
Oct 23, 2020 - Python
-
Updated
Jan 22, 2021 - Python
-
Updated
Nov 25, 2020 - Jupyter Notebook
-
Updated
Nov 13, 2020 - Jupyter Notebook
Currently we use a very ad-hoc procedure for scaling the quadratic component of NAF when used for exploration:
https://github.com/angelolovatto/raylab/blob/9820275b17ee085e1955a6d845c0bdf61333f8da/raylab/algorithms/naf/naf_policy.py#L150-L155
A possibly better alternative would be to scale it based on the desired average action stddev. Something like:
scale_tril * (1.0 / average_st-
Updated
Nov 17, 2020 - Python
-
Updated
Dec 8, 2020 - Python
-
Updated
Aug 30, 2020 - Jupyter Notebook
-
Updated
Sep 6, 2019 - Python
-
Updated
Jun 12, 2019 - Python
-
Updated
Aug 4, 2020 - Python
-
Updated
Aug 25, 2020 - Jupyter Notebook
-
Updated
Nov 27, 2020 - Jupyter Notebook
-
Updated
Nov 13, 2020 - Python
-
Updated
May 26, 2020 - Jupyter Notebook
Implement parameterized linear and non-linear dynamics model with bimodal Gaussian noise.
-
Updated
Aug 11, 2020 - Python
-
Updated
Feb 1, 2021 - Python
-
Updated
Jan 22, 2021 - Python
-
Updated
May 26, 2019 - Python
-
Updated
Sep 30, 2019 - Python
-
Updated
Dec 15, 2020 - Jupyter Notebook
-
Updated
Jan 14, 2020 - Python
-
Updated
Dec 16, 2020 - Python
-
Updated
Oct 29, 2020 - Jupyter Notebook
Improve this page
Add a description, image, and links to the rllib topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the rllib topic, visit your repo's landing page and select "manage topics."
Describe your feature request
Hi guys,
It would be awesome to add API that has same output as
ray memorycommand.Also, it would be good to add some additional output info for
ray.objects(). For example, node IP, IDs of objects which are created in in-process stores, IDs of objects from remote calls (when remote calls are still being executed).Thanks in advance!