Skip to content
#

convergence

Here are 65 public repositories matching this topic...

The goal of this project is to build an RL-based algorithm that can help cab drivers maximize their profits by improving their decision-making process on the field. Taking long-term profit as the goal, a method is proposed based on reinforcement learning to optimize taxi driving strategies for profit maximization. This optimization problem is formulated as a Markov Decision Process i.e. MDP.

  • Updated Jul 9, 2021
  • Jupyter Notebook

As part of this project, I have developed algorithms from scratch using Gradient Descent method. The first algorithm developed will be used to predict the average GPU Run Time and the second algorithm will be used to classify a GPU run process as high or low time consuming process.

  • Updated Aug 4, 2020
  • R

Improve this page

Add a description, image, and links to the convergence topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the convergence topic, visit your repo's landing page and select "manage topics."

Learn more