Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
3332.6
TFLOPS
3
3
Usama Khatab
usamakhatab980
Follow
0 followers
·
6 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 days ago
Search-o1: Agentic Search-Enhanced Large Reasoning Models
reacted
to
mkurman
's
post
with 👍
6 days ago
Blurred-Thoughts Supervised Fine-Tuning (BT-SFT) 🤖 Can we teach a model to think completely on its own without reinforcement learning? Actually, yes. We can do straightforward supervised fine-tuning using a relatively simple trick: blurring a part of CoT thoughts. But why is this effective? We observed that various models differ in their thinking processes, and fine-tuning one model on another model’s thoughts (CoT) can sometimes be inefficient—often resulting in the model simply memorizing reasoning rather than learning how to actually think. I discovered that this process can still be efficient if we clearly indicate when the model should start and stop thinking and uncover only a part of CoT and the expected answer, blurring the other part of CoT. This approach allows the model to learn only a portion of the thought process while still arriving at an expected answer. To demonstrate this, you can watch my experimental BT-SFT on meditsolutions/Llama-3.2-SUN-2.5B-chat model, which was fine-tuned on 151 million tokens from the Magpie-Align/Magpie-Reasoning-V2-250K-CoT-Deepseek-R1-Llama-70B dataset. Enjoy! 🚀 PS. If you were curious enough to read this, leave me a comment. It's always nice to chat with open-minded and intelligent ppl.
liked
a Space
6 days ago
Qwen/Qwen2.5-Max-Demo
View all activity
Organizations
None yet
usamakhatab980
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a Space
6 days ago
Running
446
446
Qwen2.5 Max Demo
🐢
Send messages for chatbot responses
liked
a model
6 days ago
deepseek-ai/Janus-Pro-7B
Any-to-Any
•
Updated
6 days ago
•
223k
•
2.67k
liked
a model
3 months ago
microsoft/OmniParser
Image-Text-to-Text
•
Updated
Dec 2, 2024
•
1.99k
•
1.55k