peft

UI tool for fine-tuning and testing your own LoRA models base on LLaMA, GPT-J and more. One-click run on Google Colab. + A Gradio ChatGPT-like Chat UI to demonstrate your language models.

machine-learning ai llama gpt lora language-model alpaca peft google-colab gpt-j alpaca-lora

Updated May 29, 2023
Python

Guitaricet / relora

Star

Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates

nlp deep-learning transformer llama distributed-training peft

Updated Nov 8, 2023
Jupyter Notebook

Joyce94 / LLM-RLHF-Tuning

Star

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

reinforcement-learning llama lora language-model fine-tuning ppo peft llm rlhf

Updated Oct 11, 2023
Python

iamarunbrahma / finetuned-qlora-falcon7b-medical

Star

Finetuning of Falcon-7B LLM using QLoRA on Mental Health Conversational Dataset

chatbot falcon healthcare chatbots lora mental-health conversational-ai fine-tuning peft llm qlora falcon-7b

Updated Sep 3, 2023
Jupyter Notebook

mindspore-courses / step_into_llm

Star

MindSpore online courses: Step into LLM

Updated Nov 25, 2023
Python

jackaduma / Vicuna-LoRA-RLHF-PyTorch

Star

A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT but with Vicuna

pytorch llama gpt lora finetune ppo peft vicuna llm chatgpt rlhf reward-models vicuna-7b

Updated Apr 28, 2023
Python

jianzhnie / open-chatgpt

Star

The open source implementation of ChatGPT, Alpaca, Vicuna and RLHF Pipeline. 从0开始实现一个ChatGPT.

llama gpt lora ppo peft llm chatgpt rlhf stanford-alpaca

Updated Jun 1, 2023
Python

jasonvanf / llama-trl

Star

LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA

adapter transformer llama gpt lora ppo peft trl gpt-4 chatgpt rlhf

Updated May 23, 2023
Python

jackaduma / ChatGLM-LoRA-RLHF-PyTorch

Star

A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM

pytorch llama gpt lora finetune ppo peft deepspeed llm chatgpt rlhf reward-models chatglm chatglm-6b

Updated Apr 28, 2023
Python

calpt / awesome-adapter-resources

Star

Collection of Tools and Papers related to Adapters / Parameter-Efficient Transfer Learning/ Fine-Tuning

nlp natural-language-processing awesome deep-learning transformers adapters peft parameter-efficient-learning parameter-efficient-tuning

Updated Nov 11, 2023
Python

jackaduma / Alpaca-LoRA-RLHF-PyTorch

Star

A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Alpaca architecture. Basically ChatGPT but with Alpaca

pytorch llama gpt lora alpaca finetune ppo peft deepspeed llm chatgpt rlhf reward-models

Updated Apr 28, 2023
Python

Improve this page

Add a description, image, and links to the peft topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the peft topic, visit your repo's landing page and select "manage topics."

Learn more