ROHITH VENKATA REDDY

knight7561

AI & ML interests

Deep learning, Autonomous Driving

Recent Activity

replied to albertvillanova's post 1 day ago

🚀 Introducing @huggingface Open Deep-Research💥 In just 24 hours, we built an open-source agent that: ✅ Autonomously browse the web ✅ Search, scroll & extract info ✅ Download & manipulate files ✅ Run calculations on data 55% on GAIA validation set! Help us improve it!💡 https://huggingface.co/blog/open-deep-research

upvoted an article 1 day ago

Open-source DeepResearch – Freeing our search agents

reacted to bartowski's post with 👍 15 days ago

Switching to `author_model-name` I posted a poll on twitter, and others have mentioned the interest in me using the convention of including the author name in the model path when I upload. It has a couple advantages, first and foremost of course is ensuring clarity of who uploaded the original model (did Qwen upload Qwen2.6? Or did someone fine tune Qwen2.5 and named it 2.6 for fun?) The second thing is that it avoids collisions, so if multiple people upload the same model and I try to quant them both, I would normally end up colliding and being unable to upload both I'll be implementing the change next week, there are just two final details I'm unsure about: First, should the files also inherit the author's name? Second, what to do in the case that the author name + model name pushes us past the character limit? Haven't yet decided how to handle either case, so feedback is welcome, but also just providing this as a "heads up"

View all activity

Organizations

knight7561's activity

replied to albertvillanova's post 1 day ago

Kudos to you guys, Feeling excited to contribute. Woorth taking a look..! #goHF

upvoted an article 1 day ago

Article

Open-source DeepResearch – Freeing our search agents

3 days ago

• 630

reacted to bartowski's post with 👍 15 days ago

Post

28496

Switching to author_model-name

I posted a poll on twitter, and others have mentioned the interest in me using the convention of including the author name in the model path when I upload.

It has a couple advantages, first and foremost of course is ensuring clarity of who uploaded the original model (did Qwen upload Qwen2.6? Or did someone fine tune Qwen2.5 and named it 2.6 for fun?)

The second thing is that it avoids collisions, so if multiple people upload the same model and I try to quant them both, I would normally end up colliding and being unable to upload both

I'll be implementing the change next week, there are just two final details I'm unsure about:

First, should the files also inherit the author's name?

Second, what to do in the case that the author name + model name pushes us past the character limit?

Haven't yet decided how to handle either case, so feedback is welcome, but also just providing this as a "heads up"

3 replies

reacted to onekq's post with 🔥 15 days ago

Post

2660

This is historical. 🎉

DeepSeek 🐋R1🐋 surpassed OpenAI 🍓o1🍓 on the dual leaderboard. What a year for the open source!

onekq-ai/WebApp1K-models-leaderboard

updated a model 17 days ago

knight7561/SmolLM2_python_coder-FT-ORPO

Text Generation • Updated 17 days ago • 2

published a model 17 days ago

knight7561/SmolLM2_python_coder-FT-ORPO

Text Generation • Updated 17 days ago • 2

updated a model 17 days ago

knight7561/SmolLM2-FT-DPO-python-code

Text Generation • Updated 17 days ago • 2

published a model 17 days ago

knight7561/SmolLM2-FT-DPO-python-code

Text Generation • Updated 17 days ago • 2

liked a Space 29 days ago

521

Open Source Ai Year In Review 2024

😻

What happened in open-source AI this year, and what’s next?

liked a model 30 days ago

nvidia/Cosmos-1.0-Autoregressive-4B

Updated 27 days ago • 2.42k • 46

upvoted a collection 30 days ago

🤖 Agents

Collection

21 items • Updated Dec 31, 2024 • 114

replied to chansung's post about 1 month ago

Thank you for the cool tool..!

updated 3 models about 2 months ago

reacted to cfahlgren1's post with ❤️ 3 months ago

Post

3171

You can clean and format datasets entirely in the browser with a few lines of SQL.

In this post, I replicate the process @mlabonne used to clean the new microsoft/orca-agentinstruct-1M-v1 dataset.

The cleaning process consists of:
- Joining the separate splits together / add split column
- Converting string messages into list of structs
- Removing empty system prompts

https://huggingface.co/blog/cfahlgren1/the-beginners-guide-to-cleaning-a-dataset

Here's his new cleaned dataset: mlabonne/orca-agentinstruct-1M-v1-cleaned