Xuan-Son Nguyen's picture

Xuan-Son Nguyen

ngxson

·

https://blog.ngxson.com

AI & ML interests

Doing AI for fun, not for profit

Recent Activity

upvoted a paper about 22 hours ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

updated a Space 3 days ago

ngxson/wllama

authored a paper 5 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

View all activity

Organizations

ngxson's activity

upvoted a paper about 22 hours ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 7 days ago • 153

upvoted an article 9 days ago

Article

Open-R1: Update #1

By

and 7 others •

10 days ago

• 268

upvoted a collection 25 days ago

Jan 17 Releases ❄️

Models and datasets of the second week of Jan 2025. • 23 items • Updated 25 days ago • 10

upvoted a collection 27 days ago

OuteTTS 0.3

4 items • Updated 27 days ago • 18

upvoted an article 27 days ago

Article

Run ComfyUI workflows for free on Spaces

Jan 14, 2024

• 51

upvoted a collection 28 days ago

2025 January

33 items • Updated 13 days ago • 12

upvoted a collection about 1 month ago

GGUF LoRA adapters

Adapters extracted from fine tuned models, using mergekit-extract-lora • 16 items • Updated 19 days ago • 3

upvoted 2 collections 3 months ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 5 days ago • 227

Papers I've read

16 items • Updated about 1 month ago • 6

upvoted an article 3 months ago

Article

Decoding Strategies in Large Language Models

By

•

Oct 29, 2024

• 40

upvoted an article 4 months ago

Article

Inference Endpoints Changelog 🚀

By

•

Oct 11, 2024

• 20

upvoted a collection 5 months ago

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18, 2024 • 227

upvoted an article 5 months ago

Article

"Diffusers Image Fill" guide

By

•

Sep 13, 2024

• 45

upvoted 2 articles 6 months ago

Article

Tool Use, Unified

Aug 12, 2024

• 73

Article

The 5 Most Under-Rated Tools on Hugging Face

Aug 22, 2024

• 86

upvoted a paper 6 months ago

To Code, or Not To Code? Exploring Impact of Code in Pre-training

Paper • 2408.10914 • Published Aug 20, 2024 • 42

upvoted 4 articles 6 months ago

Article

WWDC 24: Running Mistral 7B with Core ML

Jul 22, 2024

• 57

Article

Introduction to ggml

Aug 13, 2024

• 142

Article

XetHub is joining Hugging Face!

Aug 8, 2024

• 81

Article

2024 Security Feature Highlights

Aug 6, 2024

• 17