a3c
Here are 131 public repositories matching this topic...
-
Updated
May 29, 2020 - Python
I was surprised to see this loss function because it is generally used when the target is a distribution (i.e. sums to 1). This is not the case for the advantage estimate. However, I worked out the math and it does appear to be doing the right thing which is neat!
I think this trick should be mentioned in the code.
-
Updated
Jan 28, 2020 - Python
-
Updated
Mar 18, 2020 - Python
-
Updated
Sep 25, 2019 - Python
-
Updated
Feb 20, 2018 - Python
-
Updated
Apr 15, 2020 - Python
BTgym have two main sections, the Gym framework and the RL algorithm framework.
The RL part is tailored to the unique gym requirements of BTgym, but as new research in the field is emerging there will be a benefit in exploring new algorithms that aren't implemented by this project.
The following tutorial is my own attempt of testing the integration between the Gym part of BTgym with an externa
In tensorflow document, it says:
use_locking: If True, updating of the var, ms, and mom tensors is protected by a lock; otherwise the behavior is undefined, but may exhibit less contention.
However in the code this flag is set to False. Could this cause a problem by the racing condition?
Also, I don't understand why the original paper states it's better to share g across different thread
-
Updated
Nov 1, 2018 - Python
-
Updated
Jan 22, 2019 - Jupyter Notebook
-
Updated
Sep 4, 2018 - Python
-
Updated
Jun 1, 2020 - Java
-
Updated
May 31, 2020 - Python
-
Updated
Nov 8, 2017 - Python
-
Updated
Apr 20, 2020 - Python
-
Updated
Oct 10, 2018 - Python
-
Updated
Apr 7, 2018 - Python
-
Updated
May 4, 2020 - Python
-
Updated
Jan 6, 2020 - Python
-
Updated
Apr 3, 2017 - Python
-
Updated
Nov 11, 2019 - Python
-
Updated
Oct 25, 2018 - Python
-
Updated
Feb 27, 2019 - Jupyter Notebook
-
Updated
Jun 5, 2020 - Python
-
Updated
Jan 5, 2019 - Python
-
Updated
Nov 2, 2017 - Python
-
Updated
Mar 10, 2018 - Python
Improve this page
Add a description, image, and links to the a3c topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the a3c topic, visit your repo's landing page and select "manage topics."
I understand that these two python files show two different methods to construct a model. The original n_epoch is 500 which works perfect for both python files. But if I change n_epoch to 20, only tutorial_mnist_mlp_static.py can achieve a high test accuracy (~0.97). The other file tutorial_mnist_mlp_static_2.py only get 0.47.
The models built from these two files looks the same for me (the s