9 21 15

Huiqiang Jiang PRO

iofu728

https://www.microsoft.com/en-us/research/people/hjiang/

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

Optimizing Large Language Model Training Using FP4 Quantization

liked a model 11 days ago

Qwen/Qwen2.5-14B-Instruct-1M

upvoted a paper 14 days ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

View all activity

Organizations

iofu728's activity

upvoted a paper 8 days ago

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published 9 days ago • 32

liked a model 11 days ago

Qwen/Qwen2.5-14B-Instruct-1M

Text Generation • Updated 8 days ago • 13.3k • 220

upvoted a paper 14 days ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published 14 days ago • 42

liked 2 models 17 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • Updated 5 days ago • 415k • • 900

deepseek-ai/DeepSeek-R1

Text Generation • Updated 5 days ago • 1.54M • • 7.25k

upvoted a paper 29 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 29 days ago • 253

updated a dataset about 1 month ago

microsoft/SCBench

Viewer • Updated Dec 24, 2024 • 922 • 1.38k • 6

upvoted a paper about 2 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 345

authored a paper about 2 months ago

SCBench: A KV Cache-Centric Analysis of Long-Context Methods

Paper • 2412.10319 • Published Dec 13, 2024 • 9

upvoted a paper about 2 months ago

SCBench: A KV Cache-Centric Analysis of Long-Context Methods

Paper • 2412.10319 • Published Dec 13, 2024 • 9

commented a paper about 2 months ago

SCBench: A KV Cache-Centric Analysis of Long-Context Methods

Paper • 2412.10319 • Published Dec 13, 2024 • 9 •

New activity in microsoft/SCBench about 2 months ago

rename

#2 opened about 2 months ago by

liyucheng

updated a dataset about 2 months ago

MInference/SCBench

Viewer • Updated Dec 13, 2024 • 922 • 154

upvoted a paper about 2 months ago

Multimodal Latent Language Modeling with Next-Token Diffusion

Paper • 2412.08635 • Published Dec 11, 2024 • 44

updated a Space 4 months ago

MInference

🌍

Generate text responses to user queries

upvoted a paper 4 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 169

upvoted an article 5 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 216

authored a paper 5 months ago

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval

Paper • 2409.10516 • Published Sep 16, 2024 • 41

upvoted a paper 5 months ago

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval

Paper • 2409.10516 • Published Sep 16, 2024 • 41

commented a paper 5 months ago

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval

Paper • 2409.10516 • Published Sep 16, 2024 • 41 •