Qian Liu's picture

Qian Liu

SivilTaram

·

http://siviltaram.github.io/

AI & ML interests

Cooking cool things

Recent Activity

upvoted a paper about 2 hours ago

Demystifying Long Chain-of-Thought Reasoning in LLMs

upvoted a paper about 2 hours ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

updated a model about 8 hours ago

SivilTaram/tongyao_models

View all activity

Organizations

SivilTaram's activity

upvoted 2 papers about 2 hours ago

Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published about 23 hours ago • 18

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 2 days ago • 63

upvoted a paper 3 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 6 days ago • 88

upvoted 2 collections 10 days ago

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 11 days ago • 97

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 3 items • Updated 11 days ago • 322

upvoted 2 papers 14 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 15 days ago • 301

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Paper • 2501.10893 • Published 19 days ago • 23

upvoted 2 papers 29 days ago

OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

Paper • 2405.11143 • Published May 20, 2024 • 36

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 90

upvoted 3 papers about 1 month ago

Long-context LLMs Struggle with Long In-context Learning

Paper • 2404.02060 • Published Apr 2, 2024 • 36

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Paper • 2412.19723 • Published Dec 27, 2024 • 82

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published Dec 23, 2024 • 43

upvoted 2 papers about 2 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 345

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published Dec 6, 2024 • 47

upvoted 4 collections 2 months ago

Sailor2 Models

9 items • Updated about 1 month ago • 4

Sailor2 Post-training Datasets

3 items • Updated Dec 3, 2024 • 5

Sailor2 Pre-training Datasets

8 items • Updated Dec 4, 2024 • 4

🔱 Sailor2 Language Models

Sailing in South-East Asia with Inclusive Multilingual LLMs • 9 items • Updated Dec 3, 2024 • 22

upvoted a paper 2 months ago

Yi-Lightning Technical Report

Paper • 2412.01253 • Published Dec 2, 2024 • 27

upvoted a paper 3 months ago

When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training

Paper • 2411.13476 • Published Nov 20, 2024 • 15