Skip to content

Pinned repositories

  1. An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

    Python 5.2k 388

  2. An implementation of model parallel GPT-3-like models on GPUs, based on the DeepSpeed library. Designed to be able to train models in the hundreds of billions of parameters or larger.

    Python 965 88

  3. Open-AI's DALL-E for large scale training in mesh-tensorflow.

    Python 343 33

  4. A framework for few-shot evaluation of autoregressive language models.

    Python 84 35

  5. Python Research Framework

    Python 62 2

Repositories