Skip to content
@NVIDIA

NVIDIA Corporation

Pinned Loading

  1. cuopt cuopt Public

    GPU accelerated decision optimization

    Cuda 606 101

  2. cuopt-examples cuopt-examples Public

    NVIDIA cuOpt examples for decision optimization

    Jupyter Notebook 389 62

  3. open-gpu-kernel-modules open-gpu-kernel-modules Public

    NVIDIA Linux open GPU kernel module source

    C 16.4k 1.5k

  4. aistore aistore Public

    AIStore: scalable storage for AI applications

    Go 1.7k 230

  5. nvidia-container-toolkit nvidia-container-toolkit Public

    Build and run containers leveraging NVIDIA GPUs

    Go 3.9k 447

  6. GenerativeAIExamples GenerativeAIExamples Public

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Jupyter Notebook 3.7k 930

Repositories

Showing 10 of 641 repositories
  • Model-Optimizer Public

    A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

    NVIDIA/Model-Optimizer’s past year of commit activity
    Python 1,627 Apache-2.0 211 68 54 Updated Dec 9, 2025
  • cuopt Public

    GPU accelerated decision optimization

    NVIDIA/cuopt’s past year of commit activity
    Cuda 606 Apache-2.0 101 75 25 Updated Dec 9, 2025
  • TensorRT-LLM Public

    TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

    NVIDIA/TensorRT-LLM’s past year of commit activity
    Python 12,343 1,925 590 464 Updated Dec 9, 2025
  • cuda-python Public

    CUDA Python: Performance meets Productivity

    NVIDIA/cuda-python’s past year of commit activity
    Cython 3,077 228 209 13 Updated Dec 9, 2025
  • stdexec Public

    `std::execution`, the proposed C++ framework for asynchronous and parallel programming.

    NVIDIA/stdexec’s past year of commit activity
    C++ 2,125 Apache-2.0 220 112 13 Updated Dec 9, 2025
  • cuda-quantum Public

    C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows

    NVIDIA/cuda-quantum’s past year of commit activity
    C++ 871 309 406 (16 issues need help) 86 Updated Dec 9, 2025
  • numba-cuda Public

    The CUDA target for Numba

    NVIDIA/numba-cuda’s past year of commit activity
    Python 222 BSD-2-Clause 47 98 (1 issue needs help) 25 Updated Dec 9, 2025
  • Megatron-LM Public

    Ongoing research training transformer models at scale

    NVIDIA/Megatron-LM’s past year of commit activity
    Python 14,488 3,356 332 239 Updated Dec 9, 2025
  • cccl Public

    CUDA Core Compute Libraries

    NVIDIA/cccl’s past year of commit activity
  • Fuser Public

    A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

    NVIDIA/Fuser’s past year of commit activity
    C++ 364 71 208 (15 issues need help) 212 Updated Dec 9, 2025