Skip to content

PKU-Alignment

Explore
For
- Enterprise
- Teams
- Startups
- Education
By Solution
Case Studies
- Customer Stories
- Resources
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

PKU-Alignment

Loves Sharing and Open-Source, Making AI Safer.

92 followers
China
yaodong.yang@outlook.com

Overview
Repositories
Projects
Packages
People

More

Overview
Repositories
Projects
Packages
People

README.md

PKU-Alignment

Large language models (LLM) have immense potential in the field of general intelligence but come with significant risks. As a research team at Peking University, we are actively focusing on alignment techniques for large language models, such as safe-alignment to enhance the model's safety and reduce its toxicity.

Welcome to follow our AI Safety project:

safe-rlhf
omnisafe
safepo
safety-gymnasium

Pinned

omnisafe Public

OmniSafe is an infrastructural framework for accelerating SafeRL research.

Python 643 81
safety-gymnasium Public

Safety-Gymnasium is a highly scalable and customizable safe reinforcement learning environment library.

Python 132 15
safe-rlhf Public

Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 659 40
Safe-Policy-Optimization Public

This is a benchmark repository for safe reinforcement learning algorithms

Python 198 22

Repositories

Type

Select type

All Public Sources Forks Archived Mirrors Templates

Language

Select language

All Python

Sort

Select order

Last updated Name Stars

safe-rlhf Public
Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 659 Apache-2.0 40 4 1 Updated Jul 10, 2023
Awesome-LLM-Alignment Public

18 0 0 0 Updated Jul 10, 2023
safety-gymnasium Public
Safety-Gymnasium is a highly scalable and customizable safe reinforcement learning environment library.

Python 132 Apache-2.0 15 1 3 Updated Jul 9, 2023
omnisafe Public
OmniSafe is an infrastructural framework for accelerating SafeRL research.

Python 643 Apache-2.0 81 1 0 Updated Jul 6, 2023
Safe-Policy-Optimization Public
This is a benchmark repository for safe reinforcement learning algorithms

Python 198 Apache-2.0 22 0 0 Updated Jun 30, 2023
beavertails Public
BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).

12 Apache-2.0 0 0 0 Updated Jun 15, 2023
.github Public

0 0 0 0 Updated May 31, 2023
ReDMan Public
ReDMan is an open-source simulation platform that provides a standardized implementation of safe RL algorithms for Reliable Dexterous Manipulation.

Python 7 Apache-2.0 0 0 0 Updated May 2, 2023

People

Top languages

Loading…

Most used topics

reinforcement-learning safe-reinforcement-learning gpt llama safety

Footer

© 2023 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact GitHub
Pricing
API
Training
Blog
About

You can’t perform that action at this time.