Skip to content
master
Go to file
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
src
 
 
 
 
 
 

README.md

Based on bootleg AlphaZero.

This repository implements GRAB0 with two approaches - GBTS (Gradient Bandit Tree Search), which is MCTS but with Gradient Bandits, and SGBS (Stacked Gradient Bandit Search) with a stack of Gradient Bandits.

Below is SGBS in a nutshell:

Experimental results for GRAB0 can be found in results.md. There are a few more that are currently not pushed yet, more information will follow here.

Below is the result of training 33 runs on Cartpole using the GRAB0 approach in combination with SGBS.

Installation

Information will follow.

About

It GRABs you, it consumes you. What is GRAB doing you ask? It is learning. With 0 domain knowledge.

Resources

Releases

No releases published

Packages

No packages published
You can’t perform that action at this time.