Skip to content
🏠
Working from home
🏠
Working from home
Pro
Block or report user

Report or block ikostrikov

Hide content and notifications from this user.

Learn more about blocking users

Contact Support about this user’s behavior.

Learn more about reporting abuse

Report abuse

Organizations

@VisualComputingInstitute
Block or report user

Report or block ikostrikov

Hide content and notifications from this user.

Learn more about blocking users

Contact Support about this user’s behavior.

Learn more about reporting abuse

Report abuse

Pinned

  1. DrQ: Data regularized Q

    Jupyter Notebook 184 11

  2. PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…

    Python 1.9k 508

  3. PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".

    Python 773 203

  4. PyTorch implementations of algorithms for density estimation

    Python 398 43

  5. PyTorch implementation of Trust Region Policy Optimization

    Python 257 55

  6. A PyTorch implementation of Learning to learn by gradient descent by gradient descent

    Python 248 53

216 contributions in the last year

Jun Jul Aug Sep Oct Nov Dec Jan Feb Mar Apr May Mon Wed Fri
Activity overview
Loading

Contribution activity

May 2020

30 contributions in private repositories May 1 – May 14

Seeing something unexpected? Take a look at the GitHub profile guide.

You can’t perform that action at this time.