Collection of Distills using Open R1
asdf
ewre324
AI & ML interests
None yet
Recent Activity
upvoted
an
article
6 days ago
Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial
updated
a model
7 days ago
ewre324/ewre324-R1-Minueza-32M-Distill
updated
a collection
7 days ago
R1 Distill
Organizations
Collections
3
These models have been finetuned to perform reasoning, chain of thought.
-
ewre324/ewre324-Thinker-Llama-3.2-3B-Instruct-Reasoning
Updated • 261 -
ewre324/ewre324-Thinker-Qwen2.5-0.5B-Instruct-Reasoning
Updated • 29 -
ewre324/ewre324-Thinker-SmolLM2-135M-Instruct-Reasoning
Text Generation • Updated • 36 -
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Paper • 2201.11903 • Published • 9
models
8
ewre324/ewre324-R1-Minueza-32M-Distill
Updated
ewre324/ewre324-R1-SmolLM2-135M-Distill
Updated
•
18
ewre324/moondream2
Image-Text-to-Text
•
Updated
•
446
ewre324/ewre324-QwQ-0.5B-Distilled-SFT-Reason
Updated
•
8
ewre324/ewre324-Thinker-Llama-3.2-1B-Instruct-Reason
Updated
•
6
ewre324/ewre324-Thinker-Llama-3.2-3B-Instruct-Reasoning
Updated
•
261
ewre324/ewre324-Thinker-Qwen2.5-0.5B-Instruct-Reasoning
Updated
•
29
ewre324/ewre324-Thinker-SmolLM2-135M-Instruct-Reasoning
Text Generation
•
Updated
•
36