Skip to content
Change the repository type filter

All

    Repositories list

    • Website for CSE 234, Winter 2025
      SCSS
      Other
      47701Updated Mar 13, 2025Mar 13, 2025
    • FastVideo

      Public
      FastVideo is a lightweight framework for accelerating large video diffusion models.
      Python
      Apache License 2.0
      731.2k297Updated Mar 13, 2025Mar 13, 2025
    • llmutils

      Public
      LLM Utils
      Python
      0000Updated Mar 13, 2025Mar 13, 2025
    • Python
      382400Updated Mar 13, 2025Mar 13, 2025
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      6.2k001Updated Mar 9, 2025Mar 9, 2025
    • Dynasor

      Public
      Simple extension on vLLM to help you speed up reasoning model without training.
      Python
      MIT License
      1812861Updated Mar 8, 2025Mar 8, 2025
    • [ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
      Python
      Apache License 2.0
      741.2k321Updated Mar 6, 2025Mar 6, 2025
    • HTML
      8110Updated Feb 22, 2025Feb 22, 2025
    • [ICML 2024] CLLMs: Consistency Large Language Models
      Python
      Apache License 2.0
      1838570Updated Nov 16, 2024Nov 16, 2024
    • vllm-ltr

      Public
      [NeurIPS 2024] Efficient LLM Scheduling by Learning to Rank
      Python
      Apache License 2.0
      94100Updated Nov 4, 2024Nov 4, 2024
    • MuxServe

      Public
      Jupyter Notebook
      45220Updated Jun 13, 2024Jun 13, 2024
    • dsc291-PA

      Public
      Jupyter Notebook
      3300Updated Jun 6, 2024Jun 6, 2024
    • Website for DSC 291, Spring 2024
      SCSS
      Other
      47000Updated Jun 5, 2024Jun 5, 2024
    • Website for DSC 204a, Winter 2024
      SCSS
      Other
      47801Updated Mar 24, 2024Mar 24, 2024