reinforcement-learning

Right now, grid search variables are resolved before the random samples are generated.

If we could toggle the order of resolution, we could support this: https://discuss.ray.io/t/is-there-a-way-to-run-the-same-hyperparameter-configuration-multiple-times/1412/12

Vcpkg is a C++ dependency management system that makes installation and consumption as a dependency very easy. We should support this for VW to allow consuming the lib as easy as possible.

Instructions for creating a new package can be found here: https://github.com/microsoft/vcpkg/blob/master/docs/examples/packaging-github-repos.md

Is there a way to train a bidirectional RNN (like LSTM or GRU) on trax nowadays?

The following applies to DDPG and TD3, and possibly other models. The following libraries were installed in a virtual environment:

numpy==1.16.4
stable-baselines==2.10.0
gym==0.14.0
tensorflow==1.14.0

Episode rewards do not seem to be updated in model.learn() before callback.on_step(). Depending on which callback.locals variable is used, this means that:

episode rewards may n

reinforcement-learning

Here are 6,238 public repositories matching this topic...

ray-project / ray

[tune] Support resolving grid search variables before random samples

Feature Request: Python API to get current Ray cluster information

[core] How to obtain memory occupied by object ref?

tensorflow / tensor2tensor

Unity-Technologies / ml-agents

ShangtongZhang / reinforcement-learning-an-introduction

ddbourgin / numpy-ml

eugeneyan / applied-ml

kmario23 / deep-learning-drizzle

Hvass-Labs / TensorFlow-Tutorials

bulletphysics / bullet3

VowpalWabbit / vowpal_wabbit

Create VCPKG package for VowpalWabbit

Allow multiple data files as input

deepmind / pysc2

tensorlayer / tensorlayer

MorvanZhou / Reinforcement-learning-with-tensorflow

owainlewis / awesome-artificial-intelligence

lazyprogrammer / machine_learning_examples

google / trax

Bidirectional RNN

tensorpack / tensorpack

MorvanZhou / PyTorch-Tutorial

aws / amazon-sagemaker-examples

keras-rl / keras-rl

yandexdataschool / Practical_RL

BinRoot / TensorFlow-Book

janhuenermann / neurojs

udacity / deep-reinforcement-learning

jason718 / awesome-self-supervised-learning

arXivTimes / arXivTimes

andri27-ts / Reinforcement-Learning

pytorch / ELF

astorfi / Deep-Learning-Roadmap

hill-a / stable-baselines

Episode rewards not updated before being used by callback.on_step()

Improve this page

Add this topic to your repo