Skip to content
@NVIDIA

NVIDIA Corporation

Pinned Loading

  1. cuopt cuopt Public

    GPU accelerated decision optimization

    Cuda 637 104

  2. cuopt-examples cuopt-examples Public

    NVIDIA cuOpt examples for decision optimization

    Jupyter Notebook 391 61

  3. open-gpu-kernel-modules open-gpu-kernel-modules Public

    NVIDIA Linux open GPU kernel module source

    C 16.6k 1.6k

  4. aistore aistore Public

    AIStore: scalable storage for AI applications

    Go 1.7k 231

  5. nvidia-container-toolkit nvidia-container-toolkit Public

    Build and run containers leveraging NVIDIA GPUs

    Go 4k 457

  6. GenerativeAIExamples GenerativeAIExamples Public

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Jupyter Notebook 3.7k 949

Repositories

Showing 10 of 646 repositories
  • Fuser Public

    A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

    NVIDIA/Fuser’s past year of commit activity
    C++ 368 73 210 (15 issues need help) 227 Updated Jan 2, 2026
  • cccl Public

    CUDA Core Compute Libraries

    NVIDIA/cccl’s past year of commit activity
  • TensorRT-Incubator Public

    Experimental projects related to TensorRT

    NVIDIA/TensorRT-Incubator’s past year of commit activity
    MLIR 116 22 37 (1 issue needs help) 14 Updated Jan 2, 2026
  • TileGym Public

    Helpful kernel tutorials and examples for tile-based GPU programming

    NVIDIA/TileGym’s past year of commit activity
    Python 526 29 1 5 Updated Jan 3, 2026
  • OSMO Public

    The developer-first platform for scaling complex Physical AI workloads across heterogeneous compute—unifying training GPUs, simulation clusters, and edge devices in a simple YAML

    NVIDIA/OSMO’s past year of commit activity
    Python 61 Apache-2.0 6 23 14 Updated Jan 3, 2026
  • NeMo-Agent-Toolkit Public

    The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.

    NVIDIA/NeMo-Agent-Toolkit’s past year of commit activity
    Python 1,657 Apache-2.0 473 67 30 Updated Jan 2, 2026
  • Model-Optimizer Public

    A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

    NVIDIA/Model-Optimizer’s past year of commit activity
    Python 1,753 Apache-2.0 228 55 57 Updated Jan 3, 2026
  • Megatron-LM Public

    Ongoing research training transformer models at scale

    NVIDIA/Megatron-LM’s past year of commit activity
    Python 14,772 3,441 332 (1 issue needs help) 256 Updated Jan 3, 2026
  • TensorRT-LLM Public

    TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

    NVIDIA/TensorRT-LLM’s past year of commit activity
    Python 12,515 1,991 519 473 Updated Jan 2, 2026
  • cuda-quantum Public

    C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows

    NVIDIA/cuda-quantum’s past year of commit activity
    C++ 878 315 406 (16 issues need help) 83 Updated Jan 2, 2026