shaonianyr

Follow

🎯

Focusing

少年 shaonianyr

🎯

Focusing

Follow

QQ Group：552643038

75 followers · 3 following

Achievements

Achievements

Stars

llm-as-a-judge / Awesome-LLM-as-a-judge

254 8 Updated Mar 3, 2025

sunnynexus / Search-o1

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Python 702 77 Updated Mar 4, 2025

lmarena / arena-hard-auto

Arena-Hard-Auto: An automatic LLM benchmark.

Python 758 93 Updated Dec 29, 2024

CLUEbenchmark / SuperCLUE

SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

3,114 103 Updated May 23, 2024

excalidraw / excalidraw

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 94,271 9,060 Updated Mar 12, 2025

RSSNext / Follow

🧡 Follow everything in one place

TypeScript 23,181 972 Updated Mar 13, 2025

microsoft / BitNet

Official inference framework for 1-bit LLMs

C++ 12,796 901 Updated Feb 18, 2025

bklieger-groq / g1

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,195 377 Updated Jan 27, 2025

tree-sitter / tree-sitter

An incremental parsing system for programming tools

Rust 19,857 1,673 Updated Mar 12, 2025

Mozilla-Ocho / llamafile

Distribute and run LLMs with a single file.

C++ 21,942 1,151 Updated Mar 11, 2025

pingcap / go-randgen

a QA tool to random generate sql by bnf pattern

Go 76 44 Updated Mar 7, 2023

microsoft / onnxruntime-genai

Generative AI extensions for onnxruntime

C++ 645 156 Updated Mar 12, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 41,284 6,226 Updated Mar 13, 2025

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 23,730 2,345 Updated Mar 13, 2025

xai-org / grok-1

Grok open release

Python 50,238 8,366 Updated Aug 30, 2024

casper-hansen / AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 2,007 253 Updated Mar 6, 2025

zylon-ai / private-gpt

Interact with your documents using the power of GPT, 100% privately, no data leaks

Python 55,419 7,423 Updated Nov 13, 2024

mem0ai / mem0

The Memory layer for AI Agents

Python 26,000 2,451 Updated Mar 12, 2025

shibing624 / text2vec

text2vec, text to vector. 文本向量表征工具，把文本转化为向量矩阵，实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型，开箱即用。

Python 4,643 406 Updated Jan 2, 2025

SJTU-IPADS / PowerInfer

High-speed Large Language Model Serving for Local Deployment

C++ 8,147 425 Updated Feb 19, 2025

linexjlin / GPTs

leaked prompts of GPTs

29,386 3,992 Updated Sep 27, 2024

deepspeedai / DeepSpeedExamples

Example models using DeepSpeed

Python 6,357 1,075 Updated Mar 11, 2025

taoyds / test-suite-sql-eval

Semantic Evaluation for Text-to-SQL with Distilled Test Suites

Python 260 62 Updated Jun 5, 2024

LouisShark / chatgpt_system_prompt

A collection of GPT system prompts and various prompt injection/leaking knowledge.

HTML 8,660 1,245 Updated Mar 11, 2025

abi / screenshot-to-code

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Python 69,008 8,491 Updated Feb 25, 2025

BuilderIO / gpt-crawler

Crawl a site to generate knowledge files to create your own custom GPT from a URL

TypeScript 21,071 2,247 Updated Jan 23, 2025

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,689 1,150 Updated Mar 13, 2025

dvlab-research / LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,653 281 Updated Aug 14, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 16,256 1,540 Updated Mar 13, 2025

ggml-org / llama.cpp

LLM inference in C/C++

C++ 76,406 11,057 Updated Mar 12, 2025