2025 January - a zh-ai-community Collection

zh-ai-community 's Collections

2025 January Papers 🧐

🖼️ 2025 MLLMs

🧠 Reasoning Models

🎬 Video models

🔊 Audio Models

📌 LLMs < 35B Chat

🔢 Math models

🏆 Leaderboards & Arenas

🚀 Trending Demo

💻 Code Models

🎨 Image models

Trending Papers - November ✨

📑 Trending Papers - October 🔟

📑Trending Papers - September 9⃣️

🔥 LLMs < 10B Base

🔥 LLMs < 10B Chat

📌 LLMs < 35B Base

⚜️ LLMs < 100B

2025 January

updated 8 days ago

deepseek-ai/Janus-Pro-7B

Any-to-Any • Updated 5 days ago • 223k • 2.66k
deepseek-ai/Janus-Pro-1B

Any-to-Any • Updated 5 days ago • 64.9k • 336
tencent/Hunyuan3D-2

Image-to-3D • Updated 3 days ago • 41k • 799
tencent/Hunyuan-7B-Instruct

Text Generation • Updated 13 days ago • 327 • 39
ByteDance/Sa2VA-4B

Image-Text-to-Text • Updated 23 days ago • 4.85k • 62

Note A unified model for dense grounded understanding of images & videos.
bytedance-research/UI-TARS-72B-DPO

Image-Text-to-Text • Updated 12 days ago • 9.44k • 76
deepseek-ai/DeepSeek-R1

Text Generation • Updated 6 days ago • 1.54M • • 7.26k

Note 660B reasoning models with MIT license
deepseek-ai/DeepSeek-R1-Zero

Text Generation • Updated 6 days ago • 25.7k • 724
MiniMaxAI/MiniMax-VL-01

Image-Text-to-Text • Updated 12 days ago • 2.14k • 229

Note A non transformer based ( ViT-MLP-LLM framework) VLM
MiniMaxAI/MiniMax-Text-01

Text Generation • Updated 21 days ago • 6.48k • 501

Note 456B LLM with 1M tokens training context
Qwen/Qwen2.5-Math-PRM-7B

Text Classification • Updated 20 days ago • 13.6k • 49

Note Math model
Qwen/Qwen2.5-14B-Instruct-1M

Text Generation • Updated 8 days ago • 13.3k • 220
openbmb/MiniCPM-o-2_6

Any-to-Any • Updated 11 days ago • 316k • 922

Note End-side multimodal LLM that supports real time conversation and video understanding.
ICTNLP/llava-mini-llama-3.1-8b

Image-Text-to-Text • Updated 25 days ago • 7.19k • 42
BlinkDL/rwkv-7-world

Text Generation • Updated 6 days ago • 61

Note RNN+Transfomers
HKUSTAudio/Llasa-3B

Text-to-Speech • Updated 4 days ago • 7.08k • 415

Note TTS
DAMO-NLP-SG/VideoLLaMA3-7B

Visual Question Answering • Updated 7 days ago • 4.43k • 30
internlm/internlm3-8b-instruct

Text Generation • Updated 21 days ago • 34.8k • 191
baichuan-inc/Baichuan-M1-14B-Base

Updated 12 days ago • 198 • 18

Note Medical LLM
opencsg/Fineweb-Edu-Chinese-V2.1

Preview • Updated 20 days ago • 8.87k • 9

Note Dataset designed specifically for natural language processing (NLP) tasks in the education sector.
DAMO-NLP-SG/multimodal_textbook

Updated 26 days ago • 15.2k • 132

Note A multimodel dataset for vision language pretraining , includes 6.5M images + 0.8B text from 22k hours of instructional videos
hithink-ai/MME-Finance

Viewer • Updated 16 days ago • 402 • 81 • 7
KwaiVGI/GameFactory-Dataset

Updated 23 days ago • 179 • 9
m-a-p/YuE-s1-7B-anneal-zh-cot

Text Generation • Updated 7 days ago • 682 • 25
m-a-p/YuE-s1-7B-anneal-jp-kr-cot

Text Generation • Updated 7 days ago • 1k • 14
m-a-p/YuE-s1-7B-anneal-en-cot

Text Generation • Updated 7 days ago • 25.4k • 346
Qwen/Qwen2.5-VL-3B-Instruct

Image-Text-to-Text • Updated 2 days ago • 97.6k • 160
Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • Updated about 13 hours ago • 307k • 327
Running on Zero

1.31k

1.31k

Hunyuan3D-2.0

🌍

Text-to-3D and Image-to-3D Generation
Running

34

34

UI-TARS

🌖

Select coordinates on an image based on instructions
Running

48

48

MiniMaxVL01

💬

Generate responses using text and images
Running on Zero

1.47k

1.47k

Chat With Janus-Pro-7B

🌍

A unified multimodal understanding and generation model.
Running

443

443

Qwen2.5 Max Demo

🐢

Send messages for chatbot responses