00INDEX

Follow

🐵

Shuo Zhang 00INDEX

🐵

Follow

14 followers · 5 following

Fudan University
17:55 (UTC -12:00)

Achievements

BetaSend feedback

Achievements

BetaSend feedback

Highlights

Pro

Block or Report

Block or report 00INDEX

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned

fastnlp/fastNLP Public

fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.

Python 2.9k 458
open-nlplab/fastIE Public

Information Extraction related tools and models

Python 10 2
OpenLMLab/collie Public

A Light Toolkit to Finetune Large Models.

Python 79 14

473 contributions in the last year

Learn how we count contributions

Activity overview

Contributed to OpenLMLab/collie, open-nlplab/fastIE, 00INDEX/Uni-app-WebProject and 11 other repositories

Contribution activity

April 2023

Created 81 commits in 2 repositories

Created 1 repository

00INDEX/collie Python Apr 23

Opened 19 pull requests in 1 repository

OpenLMLab/collie 18 merged 1 closed

Update llama_colossalai.py Apr 25
update requirements.txt Apr 24
fix save_state_dict Apr 24
modified save_state_dict Apr 23
fix save_state_dict Apr 18
fix save_state_dict Apr 18
add speed benchmark for zero Apr 18
adjust speed benchmark for colossalai Apr 18
add speed benchmark for colossalai api Apr 18
Add convert_model: convert models in Collie format to HF or RAW format Apr 17
Add Colossal-AI tensor parallel support Apr 14
Add more example for Colossal-AI Apr 13
Add more Colossal-AI examples Apr 13
Add runnable examples Apr 9
add examples Apr 9
fix README.md Apr 9
add README.md & setup.py Apr 9
add README.md & setup.py Apr 9
实现colossal-Ai pipeline版本的llama Apr 8

Reviewed 1 pull request in 1 repository

OpenLMLab/collie 1 pull request

Update README.md Apr 21

Created an issue in HazyResearch/flash-attention that received 5 comments

[Question] What is the difference between FlashAttention and Memory Efficient Attention in xformers?

The paper I recently read, SELF-ATTENTION DOES NOT NEED O(n2) MEMORY, uses kernel fusion to overcome large GPU memory usage (to be more specificall…

5 comments