Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback
Python 425 76
Contest Proposal and infrastructure for the Unrestricted Adversarial Examples Challenge
Python 307 60
An adversarial example library for constructing attacks, building defenses, and benchmarking both
Jupyter Notebook 5k 1.2k
A toolkit for developing and comparing reinforcement learning algorithms.
Python 23.7k 6.8k
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Python 11.3k 3.9k
Universe: a software platform for measuring and training an AI's general intelligence across the world's supply of games, websites and other applications.
Python 7.3k 895
Seeing something unexpected? Take a look at the GitHub profile guide.