Skip to content
@NVIDIA

NVIDIA Corporation

Pinned Loading

  1. cuopt cuopt Public

    GPU accelerated decision optimization

    Cuda 550 91

  2. cuopt-examples cuopt-examples Public

    NVIDIA cuOpt examples for decision optimization

    Jupyter Notebook 379 56

  3. open-gpu-kernel-modules open-gpu-kernel-modules Public

    NVIDIA Linux open GPU kernel module source

    C 16.3k 1.5k

  4. aistore aistore Public

    AIStore: scalable storage for AI applications

    Go 1.6k 223

  5. nvidia-container-toolkit nvidia-container-toolkit Public

    Build and run containers leveraging NVIDIA GPUs

    Go 3.8k 431

  6. GenerativeAIExamples GenerativeAIExamples Public

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Jupyter Notebook 3.6k 904

Repositories

Showing 10 of 625 repositories
  • Megatron-LM Public

    Ongoing research training transformer models at scale

    NVIDIA/Megatron-LM’s past year of commit activity
    Python 14,183 3,270 314 200 Updated Nov 13, 2025
  • cuda-quantum Public

    C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows

    NVIDIA/cuda-quantum’s past year of commit activity
    C++ 849 300 425 (17 issues need help) 82 Updated Nov 13, 2025
  • JAX-Toolbox Public

    JAX-Toolbox

    NVIDIA/JAX-Toolbox’s past year of commit activity
    Python 359 Apache-2.0 66 80 35 Updated Nov 13, 2025
  • TensorRT-LLM Public

    TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

    NVIDIA/TensorRT-LLM’s past year of commit activity
    C++ 12,117 Apache-2.0 1,862 723 424 Updated Nov 13, 2025
  • TensorRT-Model-Optimizer Public

    A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed.

    NVIDIA/TensorRT-Model-Optimizer’s past year of commit activity
    Python 1,529 Apache-2.0 193 60 44 Updated Nov 13, 2025
  • NVIDIA/phosphor-post-code-manager’s past year of commit activity
    C++ 1 Apache-2.0 3 0 0 Updated Nov 13, 2025
  • nsmd Public

    MCTP VDM-based Nvidia System Management API

    NVIDIA/nsmd’s past year of commit activity
    C++ 4 Apache-2.0 0 1 0 Updated Nov 13, 2025
  • cuopt Public

    GPU accelerated decision optimization

    NVIDIA/cuopt’s past year of commit activity
    Cuda 550 Apache-2.0 91 73 22 Updated Nov 13, 2025
  • numbast Public

    Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.

    NVIDIA/numbast’s past year of commit activity
    Python 52 Apache-2.0 16 30 (3 issues need help) 13 Updated Nov 13, 2025
  • go-ratelimit Public

    High-performance distributed rate limiting library for Go with Redis backend, sliding window algorithm, and dynamic configuration support

    NVIDIA/go-ratelimit’s past year of commit activity
    Go 7 Apache-2.0 1 0 1 Updated Nov 13, 2025