#
ucb
Here are 32 public repositories matching this topic...
-
Updated
May 10, 2020 - HTML
Structure and Interpretation of Computer Programs
-
Updated
Sep 4, 2019 - Python
arm
algorithm
reinforcement-learning
simulation
monte-carlo
rank
thompson-sampling
reinforcement-learning-algorithms
ucb
reward
multi-armed-bandit
montecarlo-simulation
contextual-bandits
ranking-algorithm
mab
ranked-mab
-
Updated
Jul 23, 2019 - Python
All projects about ucb-61b(2014 spring), http://www.cs.berkeley.edu/~jrs/61b/index.html
-
Updated
Apr 1, 2016 - Java
UCThello - a board game demonstrator (Othello variant) with computer AI using Monte Carlo Tree Search (MCTS) with UCB (Upper Confidence Bounds) applied to trees (UCT in short)
game
board-game
mobile
ai
simulation
mobile-app
artificial-intelligence
mcts
othello
mobile-game
entertainment
ucb
uct
monte-carlo-tree-search
ai-players
upper-confidence-bounds
abstract-game
perfect-information
2-player-strategy-game
-
Updated
Mar 30, 2018 - JavaScript
A Julia Package for providing Multi Armed Bandit Experiments
reinforcement-learning
julia
julia-language
thompson-sampling
reinforcement-learning-algorithms
multi-arm-bandits
ucb
julia-package
exp
julialang
mab
bandit-experiments
-
Updated
Jul 19, 2018 - Julia
Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay
reinforcement-learning
linear-programming
thompson-sampling
epsilon-greedy
ucb
policy-evaluation
mdps
multi-armed-bandits
policy-iteration
randomised-algorithms
reinforcement-learning-excercises
kl-divergence
markovian-epidemic-processes
reinforcement-learning-analysis
multiarm-bandit
ucb1
howards-pi
batch-switching
randomized-policy-iteration
-
Updated
May 21, 2018 - Python
All projects about ucb-cs186(fall 2013), you can get information from the course website(https://sites.google.com/site/cs186fall2013)
-
Updated
May 24, 2016 - Java
Multi-armed bandit algorithm with tensorflow and 11 policies
-
Updated
May 1, 2019 - Python
Oware and Ouril - traditional African Mancala games with computer AI using Monte Carlo Tree Search (MCTS) with UCB (Upper Confidence Bounds) applied to trees (UCT in short)
game
board-game
mobile
ai
mobile-app
artificial-intelligence
mancala
oware
mcts
mobile-game
entertainment
ucb
uct
monte-carlo-tree-search
upper-confidence-bounds
abstract-game
perfect-information
2-player-strategy-game
mancala-game
ouril
-
Updated
Mar 30, 2018 - HTML
Thompson Sampling for Bandits using UCB policy
-
Updated
Jul 29, 2017 - Python
3 dimensional Four in a Row game with computer AI using Monte Carlo Tree Search (MCTS) with UCB (Upper Confidence Bounds) applied to trees (UCT in short).
game
board-game
mobile
ai
mobile-app
artificial-intelligence
mcts
mobile-game
entertainment
ucb
uct
monte-carlo-tree-search
upper-confidence-bounds
abstract-game
perfect-information
2-player-strategy-game
-
Updated
Jan 29, 2018 - JavaScript
Alquerque - a 2 player abstract strategic perfect information traditional board game with computer AI option.
game
board-game
mobile
ai
mobile-app
artificial-intelligence
mcts
mobile-game
entertainment
ucb
uct
checkers
draughts
monte-carlo-tree-search
ai-players
upper-confidence-bounds
perfect-information
2-player-strategy-game
deterministic-game
-
Updated
Mar 30, 2018 - JavaScript
AI for the game "Connect Four". Available on PyPI.
monte-carlo
artificial-intelligence
connect-four
puissance-4
tree-search
ucb
uct
ai-bots
monte-carlo-tree-search
ai-players
game-ai
upper-confidence-bounds
ai-opponents
connect4
artificial-intelligence-algorithms
puissance4
ai-agents
connect-4
game-artificial-intelligence
connect4-game
-
Updated
Mar 3, 2019 - Python
Foundations Of Intelligent Learning Agents (FILA) Assignments
reinforcement-learning
monte-carlo
linear-programming
thompson-sampling
ucb
bootstrapping
multi-armed-bandits
bellman-equation
temporal-differencing-learning
howards-pi
sarsa-learning
kl-ucb
windy-gridworld
intelligent-learning-agents
-
Updated
Nov 8, 2019 - Python
Chapter wise implementation & analysis of all the algorithms in RL : An Intoduction by Richard S. Sutton and Andrew G. Barto
reinforcement-learning
artificial-intelligence
epsilon-greedy
python-3
ucb
k-armed-bandit
gradient-bandit
optimistic-inital-values
-
Updated
Jun 22, 2020 - Jupyter Notebook
Complete Tutorial Guide with Code for learning ML
natural-language-processing
random-forest
svm
scikit-learn
artificial-neural-networks
logistic-regression
ucb
polynomial-regression
kmeans-clustering
knearest-neighbor-algorithm
apriori-algorithm
classification-methods
svr
kernel-svm
kernel-pca
heirarchical-clustering
decison-trees
-
Updated
Jul 6, 2019 - Python
Implementation of 9 multi-armed bandit algorithm for the stationary stochastic environment
-
Updated
Apr 24, 2020 - MATLAB
Codes and templates for ML algorithms created, modified and optimized in Python and R.
feature-selection
datascience
feature-extraction
thompson-sampling
dimensionality-reduction
ucb
ann
regression-models
nlp-machine-learning
kmeans-clustering
apriori-algorithm
hierarchical-clustering
classification-algorithims
parameter-tuning
regression-algorithms
xgboost-model
kfold-cross-validation
cnn-classification
eclat-algorithm
-
Updated
Mar 28, 2020 - Python
A site for Bengali Student Association club at UC Berkeley
-
Updated
Oct 31, 2019 - Python
R.I.T project
-
Updated
Jul 29, 2019 - Python
Improve this page
Add a description, image, and links to the ucb topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the ucb topic, visit your repo's landing page and select "manage topics."