AK's picture

AK

akhaliq

·

_akhaliq

AI & ML interests

None yet

Recent Activity

commented on a paper about 8 hours ago

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

commented on a paper about 8 hours ago

LIMO: Less is More for Reasoning

commented on a paper about 8 hours ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

View all activity

Organizations

akhaliq's activity

commented 3 papers about 8 hours ago

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

Paper • 2502.03275 • Published about 22 hours ago • 2 •

LIMO: Less is More for Reasoning

Paper • 2502.03387 • Published about 20 hours ago • 14 •

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 1 day ago • 37 •

commented a paper 1 day ago

ACECODER: Acing Coder RL via Automated Test-Case Synthesis

Paper • 2502.01718 • Published 3 days ago • 21 •

commented 5 papers 2 days ago

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published 3 days ago • 149 •

ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning

Paper • 2502.01100 • Published 3 days ago • 12 •

The Jumping Reasoning Curve? Tracking the Evolution of Reasoning Performance in GPT-[n] and o-[n] Models on Multimodal Puzzles

Paper • 2502.01081 • Published 3 days ago • 9 •

Scaling Embedding Layers in Language Models

Paper • 2502.01637 • Published 3 days ago • 16 •

Improving Transformer World Models for Data-Efficient RL

Paper • 2502.01591 • Published 3 days ago • 8 •

commented 4 papers 3 days ago

MatAnyone: Stable Video Matting with Consistent Memory Propagation

Paper • 2501.14677 • Published 13 days ago • 26 •

Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming

Paper • 2501.18837 • Published 7 days ago • 7 •

s1: Simple test-time scaling

Paper • 2501.19393 • Published 6 days ago • 88 •

Trading Inference-Time Compute for Adversarial Robustness

Paper • 2501.18841 • Published 6 days ago • 3 •

commented 2 papers 6 days ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 7 days ago • 49 •

Large Language Models Think Too Fast To Explore Effectively

Paper • 2501.18009 • Published 8 days ago • 22 •

commented 3 papers 7 days ago

Atla Selene Mini: A General Purpose Evaluation Model

Paper • 2501.17195 • Published 10 days ago • 30 •

Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation

Paper • 2501.17749 • Published 8 days ago • 12 •

TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models

Paper • 2501.16937 • Published 9 days ago • 4 •

commented 2 papers 8 days ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 9 days ago • 100 •

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published 9 days ago • 32 •