Nishith Jain's picture

Nishith Jain

KingNish

·

AI & ML interests

AI is fun actually. Busy till June 2025.

Recent Activity

liked a dataset 35 minutes ago

simplescaling/s1K

liked a model about 11 hours ago

Bingsu/adetailer

liked a Space 1 day ago

huggingface/number-tokenization-blog

View all activity

Organizations

KingNish's activity

upvoted a paper 1 day ago

Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models

Paper • 2402.14207 • Published Feb 22, 2024 • 6

upvoted an article 2 days ago

Article

Open-source DeepResearch – Freeing our search agents

3 days ago

• 648

upvoted a paper 3 days ago

Scalable-Softmax Is Superior for Attention

Paper • 2501.19399 • Published 6 days ago • 18

upvoted an article 3 days ago

Article

🚀 Deploying OLMo-7B with Text Generation Inference (TGI) on Hugging Face Spaces

By

•

4 days ago

• 5

upvoted a paper 4 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 6 days ago • 88

upvoted an article 4 days ago

Article

They Said It Couldn’t Be Done

By

and 2 others •

Dec 5, 2024

• 80

upvoted 2 collections 4 days ago

LLM Reasoning Papers

Papers to improve reasoning capabilities of LLMs • 20 items • Updated 22 days ago • 113

Reasoning Datasets

Distilled synthetic Reasoning datasets • 7 items • Updated 4 days ago • 45

upvoted a paper 4 days ago

SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer

Paper • 2501.18427 • Published 7 days ago • 16

upvoted a collection 4 days ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 6 items • Updated 6 days ago • 34

upvoted a paper 4 days ago

Large Language Models Think Too Fast To Explore Effectively

Paper • 2501.18009 • Published 8 days ago • 22

upvoted an article 4 days ago

Article

Open-R1: Update #1

By

and 7 others •

5 days ago

• 237

upvoted an article 5 days ago

Article

The AHA Indicator

By

•

5 days ago

• 3

upvoted a collection 5 days ago

UpScale / Enhancers

9 items • Updated 5 days ago • 9

upvoted an article 6 days ago

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

By

•

6 days ago

• 29

upvoted an article 7 days ago

Article

Faster Text Generation with Self-Speculative Decoding

Nov 20, 2024

• 51

upvoted 2 articles 8 days ago

Article

PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs

By

•

13 days ago

• 12

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

By

•

8 days ago

• 23

upvoted an article 9 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

10 days ago

• 648

upvoted an article 16 days ago

Article

The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about...

By

•

17 days ago

• 56