Skip to content
@FMInference

Foundation Model Inference

Inference Systems for Foundation Models

Pinned

  1. FlexGen Public

    Running large language models like OPT-175B/GPT-3 on a single GPU. Up to 100x faster than other offloading systems.

    Python 1.5k 64

Repositories

  • FlexGen Public

    Running large language models like OPT-175B/GPT-3 on a single GPU. Up to 100x faster than other offloading systems.

    Python 1,492 Apache-2.0 64 8 1 Updated Feb 21, 2023
  • 0 0 0 0 Updated Feb 21, 2023

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Python

Most used topics

Loading…