Grow your team on GitHub
GitHub is home to over 40 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
Sign up
Pinned repositories
-
NeMo
Neural Modules: a toolkit for conversational AI
-
nvtx-plugins
Python bindings for NVTX
-
aistore
AIStore: scalable storage for AI applications
-
tensorrt-inference-server
The TensorRT Inference Server provides a cloud inferencing solution optimized for NVIDIA GPUs.
-
ai-assisted-annotation-client
Client side integration example source code and libraries for AI-Assisted Annotation SDK
-
deepops
Tools for building GPU clusters
-
DALI
A library containing both highly optimized building blocks and an execution engine for data pre-processing in deep learning applications
-
-
gpu-operator
NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes
-
Q2RTX
NVIDIA’s implementation of RTX ray-tracing in Quake II
-
apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
-
container-wiki
Documentation for the NVIDIA cloud native technologies
-
jitify
A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).
-
OptiX_Apps
Advanced Samples for the NVIDIA OptiX 7 Ray Tracing SDK
-
nvidia-settings
NVIDIA driver control panel
-
data-science-stack
NVIDIA Data Science stack tools
-
pyxis
Container plugin for Slurm Workload Manager
-
DeepLearningExamples
Deep Learning Examples
-
gpu-monitoring-tools
Tools for monitoring NVIDIA GPUs on Linux
-
enroot
A simple yet powerful tool to turn traditional container/OS images into unprivileged sandboxes.
-
TensorRT
TensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.
-
libnvidia-container
NVIDIA container runtime library
-
tensorflow-determinism
Tracking, debugging, and patching non-determinism in TensorFlow
-
flownet2-pytorch
Pytorch implementation of FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks
-
nccl
Optimized primitives for collective multi-GPU communication
-
-
retinanet-examples
Fast and accurate object detection with end-to-end GPU optimization
-
UnsupervisedLandmarkLearning
Implementation for the unsupervised latent landmark learning work from NVIDIA Applied Deep Learning Research
-
tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
-
ipyparaview
iPython widget for server-side ParaView rendering in Jupyter.