Skip to content
View shaonianyr's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report shaonianyr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Python 702 77 Updated Mar 4, 2025

Arena-Hard-Auto: An automatic LLM benchmark.

Python 758 93 Updated Dec 29, 2024

SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

3,114 103 Updated May 23, 2024

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 94,271 9,060 Updated Mar 12, 2025

🧡 Follow everything in one place

TypeScript 23,181 972 Updated Mar 13, 2025

Official inference framework for 1-bit LLMs

C++ 12,796 901 Updated Feb 18, 2025

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,195 377 Updated Jan 27, 2025

An incremental parsing system for programming tools

Rust 19,857 1,673 Updated Mar 12, 2025

Distribute and run LLMs with a single file.

C++ 21,942 1,151 Updated Mar 11, 2025

a QA tool to random generate sql by bnf pattern

Go 76 44 Updated Mar 7, 2023

Generative AI extensions for onnxruntime

C++ 645 156 Updated Mar 12, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 41,284 6,226 Updated Mar 13, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 23,730 2,345 Updated Mar 13, 2025

Grok open release

Python 50,238 8,366 Updated Aug 30, 2024

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 2,007 253 Updated Mar 6, 2025

Interact with your documents using the power of GPT, 100% privately, no data leaks

Python 55,419 7,423 Updated Nov 13, 2024

The Memory layer for AI Agents

Python 26,000 2,451 Updated Mar 12, 2025

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

Python 4,643 406 Updated Jan 2, 2025

High-speed Large Language Model Serving for Local Deployment

C++ 8,147 425 Updated Feb 19, 2025

leaked prompts of GPTs

29,386 3,992 Updated Sep 27, 2024

Example models using DeepSpeed

Python 6,357 1,075 Updated Mar 11, 2025

Semantic Evaluation for Text-to-SQL with Distilled Test Suites

Python 260 62 Updated Jun 5, 2024

A collection of GPT system prompts and various prompt injection/leaking knowledge.

HTML 8,660 1,245 Updated Mar 11, 2025

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Python 69,008 8,491 Updated Feb 25, 2025

Crawl a site to generate knowledge files to create your own custom GPT from a URL

TypeScript 21,071 2,247 Updated Jan 23, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,689 1,150 Updated Mar 13, 2025

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,653 281 Updated Aug 14, 2024

Fast and memory-efficient exact attention

Python 16,256 1,540 Updated Mar 13, 2025

LLM inference in C/C++

C++ 76,406 11,057 Updated Mar 12, 2025
Next
Showing results