Skip to content
Change the repository type filter

All

    Repositories list

    • FlashInfer: Kernel Library for LLM Serving
      Cuda
      5554k22564Updated Nov 3, 2025Nov 3, 2025
    • Building the Virtuous Cycle for AI-driven LLM Systems
      Python
      127723Updated Nov 2, 2025Nov 2, 2025
    • whl

      Public
      Pre-built wheels for flashinfer python package.
      HTML
      4200Updated Nov 2, 2025Nov 2, 2025
    • Project website of FlashInfer project
      SCSS
      4010Updated Oct 22, 2025Oct 22, 2025
    • cubloaty

      Public
      a size profiler for cuda binary
      Python
      05200Updated Oct 7, 2025Oct 7, 2025
    • web-data

      Public
      0000Updated Jun 25, 2025Jun 25, 2025
    • Python
      36400Updated Apr 26, 2025Apr 26, 2025
    • Simple python library for generating your own perfetto traces for your application. Can be used for both app instrumentation and custom trace generation (for your own purposes)
      Python
      6100Updated Apr 16, 2025Apr 16, 2025
    • flashinfer-nightly

      Public archive
      FlashInfer Nightly
      1600Updated Apr 9, 2025Apr 9, 2025
    • 0400Updated Apr 2, 2025Apr 2, 2025
    • Jupyter Notebook
      0200Updated Jan 10, 2025Jan 10, 2025
    • Debug print operator for cudagraph debugging
      Cuda
      21411Updated Aug 2, 2024Aug 2, 2024
    • The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
      15k000Updated Apr 21, 2024Apr 21, 2024
    • candle

      Public
      Minimalist ML framework for Rust
      Rust
      1.3k000Updated Mar 7, 2024Mar 7, 2024