Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
-
Updated
Mar 13, 2025 - Python
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
Simple extension on vLLM to help you speed up reasoning model without training.
Agentic Deep Graph Reasoning Implementation
Lightweight replication study of DeepSeek-R1-Zero. Interesting findings include "No Aha Moment", "Longer CoT ≠ Accuracy", and "Language Mixing in Instruct Models".
Pure RL to post-train base models for social reasoning capabilities. Lightweight replication of DeepSeek-R1-Zero with Social IQa dataset.
Add a description, image, and links to the reasoning-models topic page so that developers can more easily learn about it.
To associate your repository with the reasoning-models topic, visit your repo's landing page and select "manage topics."