pagezyhf (Simon Pagezy)

updated a dataset about 7 hours ago

huggingface/documentation-images

Viewer • Updated 15 minutes ago • 50 • 3.87M • 47

liked a Space 5 days ago

329

NeuralJam

🚂

EscapeExpress : LLM AI detective puzzle game.

upvoted an article 6 days ago

Article

Open-source DeepResearch – Freeing our search agents

8 days ago

• 908

posted an update 12 days ago

Post

1637

We published https://huggingface.co/blog/deepseek-r1-aws!

If you are using AWS, give a read. It is a running document to showcase how to deploy and fine-tune DeepSeek R1 models with Hugging Face on AWS.

We're working hard to enable all the scenarios, whether you want to deploy to Inference Endpoints, Sagemaker or EC2; with GPUs or with Trainium & Inferentia.

We have full support for the distilled models, DeepSeek-R1 support is coming soon!! I'll keep you posted.

Cheers

1 reply

·

updated a dataset 12 days ago

amazon-sagemaker/repository-metadata

Preview • Updated 5 days ago • 305 • 1

published an article 13 days ago

Article

How to deploy and fine-tune DeepSeek models on AWS

13 days ago

• 38

reacted to m-ric's post with 🚀 13 days ago

Post

3893

𝗧𝗵𝗲 𝗛𝘂𝗯 𝘄𝗲𝗹𝗰𝗼𝗺𝗲𝘀 𝗲𝘅𝘁𝗲𝗿𝗻𝗮𝗹 𝗶𝗻𝗳𝗲𝗿𝗲𝗻𝗰𝗲 𝗽𝗿𝗼𝘃𝗶𝗱𝗲𝗿𝘀!

✅ Hosting our own inference was not enough: now the Hub 4 new inference providers: fal, Replicate, SambaNova Systems, & Together AI.

Check model cards on the Hub: you can now, in 1 click, use inference from various providers (cf video demo)

Their inference can also be used through our Inference API client. There, you can use either your custom provider key, or your HF token, then billing will be handled directly on your HF account, as a way to centralize all expenses.

💸 Also, PRO users get 2$ inference credits per month!

Read more in the announcement 👉 https://huggingface.co/blog/inference-providers

1 reply

·

New activity in deepseek-ai/DeepSeek-R1 13 days ago

problem with using serverless inference

1

#78 opened 13 days ago by

manju2345

New activity in amazon-sagemaker/repository-metadata 13 days ago

Update modal.json

#29 opened 13 days ago by

pagezyhf

upvoted an article 14 days ago

Article

Welcome to Inference Providers on the Hub 🔥

15 days ago

• 319

New activity in deepseek-ai/DeepSeek-R1-Distill-Llama-70B 14 days ago

Amazon Sagemaker deployment failing with CUDA OutOfMemory error

3

#10 opened 15 days ago by

neelkapadia

New activity in Qwen/Qwen2-VL-7B-Instruct 15 days ago

Anyone able to deploy an inference endpoint on sagemaker?

6

#71 opened about 1 month ago by

TeoGX

reacted to merve's post with 👍 15 days ago

Post

5090

Oof, what a week! 🥵 So many things have happened, let's recap! merve/jan-24-releases-6793d610774073328eac67a9

Multimodal 💬
- We have released SmolVLM -- tiniest VLMs that come in 256M and 500M, with it's retrieval models ColSmol for multimodal RAG 💗
- UI-TARS are new models by ByteDance to unlock agentic GUI control 🤯 in 2B, 7B and 72B
- Alibaba DAMO lab released VideoLlama3, new video LMs that come in 2B and 7B
- MiniMaxAI released Minimax-VL-01, where decoder is based on MiniMax-Text-01 456B MoE model with long context
- Dataset: Yale released a new benchmark called MMVU
- Dataset: CAIS released Humanity's Last Exam (HLE) a new challenging MM benchmark

LLMs 📖
- DeepSeek-R1 & DeepSeek-R1-Zero: gigantic 660B reasoning models by DeepSeek, and six distilled dense models, on par with o1 with MIT license! 🤯
- Qwen2.5-Math-PRM: new math models by Qwen in 7B and 72B
- NVIDIA released AceMath and AceInstruct, new family of models and their datasets (SFT and reward ones too!)

Audio 🗣️
- Llasa is a new speech synthesis model based on Llama that comes in 1B,3B, and 8B
- TangoFlux is a new audio generation model trained from scratch and aligned with CRPO

Image/Video/3D Generation ⏯️
- Flex.1-alpha is a new 8B pre-trained diffusion model by ostris similar to Flux
- tencent released Hunyuan3D-2, new 3D asset generation from images

7 replies

·

upvoted an article 19 days ago

Article

Mastering Long Contexts in LLMs with KVPress

By

and 1 other •

19 days ago

• 62

liked a model 19 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 2 days ago • 2.94M • • 8.29k

liked 2 models 20 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • Updated 2 days ago • 565k • • 994

deepseek-ai/DeepSeek-R1-Zero

Text Generation • Updated 2 days ago • 28.1k • 773

upvoted a collection 20 days ago

DeepSeek-R1

Collection

8 items • Updated 22 days ago • 472

reacted to burtenshaw's post with 🔥 23 days ago

Post

43532

We’re launching a FREE and CERTIFIED course on Agents!

We're thrilled to announce the launch of the Hugging Face Agents course on Learn! This interactive, certified course will guide you through building and deploying your own AI agents.

Here's what you'll learn:

- Understanding Agents: We'll break down the fundamentals of AI agents, showing you how they use LLMs to perceive their environment (observations), reason about it (thoughts), and take actions. Think of a smart assistant that can book appointments, answer emails, or even write code based on your instructions.
- Building with Frameworks: You'll dive into popular agent frameworks like LangChain, LlamaIndex and smolagents. These tools provide the building blocks for creating complex agent behaviors.
- Real-World Applications: See how agents are used in practice, from automating SQL queries to generating code and summarizing complex documents.
- Certification: Earn a certification by completing the course modules, implementing a use case, and passing a benchmark assessment. This proves your skills in building and deploying AI agents.
Audience

This course is designed for anyone interested in the future of AI. Whether you're a developer, data scientist, or simply curious about AI, this course will equip you with the knowledge and skills to build your own intelligent agents.

Enroll today and start building the next generation of AI agent applications!

https://bit.ly/hf-learn-agents

28 replies

·

reacted to mlabonne's post with 🤗 23 days ago

Post

4366

🆕 LLM Course 2025 edition!

I updated the LLM Scientist roadmap and added a ton of new information and references. It covers training, datasets, evaluation, quantization, and new trends like test-time compute scaling.

The LLM Course has been incredibly popular (41.3k stars!) and I've been touched to receive many, many messages about how it helped people in their careers.

I know how difficult this stuff can be, so I'm super proud of the impact it had. I want to keep updating it in 2025, especially with the LLM Engineer roadmap.

Thanks everyone, hope you'll enjoy it!

💻 LLM Course: https://huggingface.co/blog/mlabonne/llm-course

Simon Pagezy

AI & ML interests

Recent Activity

Organizations

pagezyhf's activity

huggingface/documentation-images

NeuralJam

Open-source DeepResearch – Freeing our search agents

amazon-sagemaker/repository-metadata

How to deploy and fine-tune DeepSeek models on AWS

problem with using serverless inference

Update modal.json

Welcome to Inference Providers on the Hub 🔥

Amazon Sagemaker deployment failing with CUDA OutOfMemory error

Anyone able to deploy an inference endpoint on sagemaker?

Mastering Long Contexts in LLMs with KVPress

deepseek-ai/DeepSeek-R1

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

deepseek-ai/DeepSeek-R1-Zero

DeepSeek-R1