Skip to content
Change the repository type filter

All

    Repositories list

    • Python
      0000Updated Mar 11, 2025Mar 11, 2025
    • A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
      Python
      Apache License 2.0
      378000Updated Mar 11, 2025Mar 11, 2025
    • Shell
      Other
      41740Updated Mar 4, 2025Mar 4, 2025
    • LiteGS

      Public
      A refactored codebase for Gaussian Splatting. Faster(3.5x)!! Modular!! Pure Python or CUDA Extension
      Python
      Other
      25200Updated Mar 4, 2025Mar 4, 2025
    • Go
      Apache License 2.0
      1200Updated Mar 1, 2025Mar 1, 2025
    • 0700Updated Feb 28, 2025Feb 28, 2025
    • MT-DeepEP

      Public
      DeepEP: an efficient expert-parallel communication library
      C++
      Other
      617400Updated Feb 27, 2025Feb 27, 2025
    • A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.
      Python
      Other
      253100Updated Feb 27, 2025Feb 27, 2025
    • C++
      MIT License
      01100Updated Feb 26, 2025Feb 26, 2025
    • mutlass

      Public
      MUSA Templates for Linear Algebra Subroutines
      C++
      Other
      1.2k2410Updated Feb 26, 2025Feb 26, 2025
    • torch_musa is an open source repository based on PyTorch, which can make full use of the super computing power of MooreThreads graphics cards.
      Python
      Other
      29369520Updated Feb 7, 2025Feb 7, 2025
    • MooER

      Public
      MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction models along with training and inference code, covering but not limited to end-to-end speech interaction, end-to-end speech translation and speech recognition.
      Python
      Other
      1519520Updated Jan 8, 2025Jan 8, 2025
    • kineto

      Public
      HTML
      Other
      2000Updated Jan 2, 2025Jan 2, 2025
    • TurboSplat-Viz is a 3D Gaussian Splatting (GS) renderer implemented using DirectX 12. Leveraging the exceptional performance of Mesh Shaders, DX12GSViewer achieves unparalleled speed improvements.
      C++
      MIT License
      0300Updated Nov 29, 2024Nov 29, 2024
    • TurboRAG

      Public
      Python
      67260Updated Nov 25, 2024Nov 25, 2024
    • vllm_musa

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Other
      6.2k4220Updated Oct 28, 2024Oct 28, 2024
    • SimuMax

      Public
      a static analytical model for LLM distributed training
      Python
      Other
      01000Updated Oct 18, 2024Oct 18, 2024
    • RetinaGS

      Public
      Python
      Other
      61600Updated Oct 17, 2024Oct 17, 2024
    • Repository for OpenCV's extra modules
      C++
      Other
      5.8k200Updated Sep 25, 2024Sep 25, 2024
    • opencv

      Public
      Open Source Computer Vision Library
      C++
      Other
      56k1800Updated Sep 25, 2024Sep 25, 2024
    • muThrust

      Public
      The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl
      C++
      Other
      756100Updated Sep 14, 2024Sep 14, 2024
    • muAlg

      Public
      Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl
      Cuda
      Other
      450200Updated Sep 13, 2024Sep 13, 2024
    • dynolog

      Public
      Dynolog is a telemetry daemon for performance monitoring and tracing. It exports metrics from different components in the system like the linux kernel, CPU, disks, Intel PT, GPUs etc. Dynolog also integrates with pytorch and can trigger traces for distributed training applications.
      C++
      Other
      01300Updated Aug 7, 2024Aug 7, 2024
    • qtbase

      Public
      Qt Base (Core, Gui, Widgets, Network, ...)
      C++
      1.1k000Updated Jun 20, 2024Jun 20, 2024
    • C++
      Other
      121000Updated Jun 20, 2024Jun 20, 2024
    • Character Animation (AnimateAnyone, Face Reenactment)
      Python
      Apache License 2.0
      2623.3k1017Updated May 31, 2024May 31, 2024
    • Python
      BSD 3-Clause "New" or "Revised" License
      12118110Updated Jan 16, 2024Jan 16, 2024