6 140 683

Abdullah Abdelrhim

abdullah

abodacs

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Improving Transformer World Models for Data-Efficient RL

upvoted an article 4 days ago

Open-R1: Update #1

upvoted a collection 4 days ago

Reasoning Datasets

View all activity

Organizations

abdullah's activity

upvoted a paper 2 days ago

Improving Transformer World Models for Data-Efficient RL

Paper • 2502.01591 • Published 3 days ago • 8

upvoted an article 4 days ago

Article

Open-R1: Update #1

and 7 others •

5 days ago

• 237

upvoted a collection 4 days ago

Reasoning Datasets

Collection

Distilled synthetic Reasoning datasets • 7 items • Updated 4 days ago • 45

upvoted an article 14 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

15 days ago

• 119

upvoted a paper 14 days ago

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Paper • 2501.10893 • Published 19 days ago • 23

upvoted a paper 28 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 29 days ago • 253

upvoted a paper 29 days ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 90

upvoted a paper about 1 month ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 88

upvoted an article about 1 month ago

Article

Fine-tune ModernBERT for text classification using synthetic data

•

Dec 30, 2024

• 31

upvoted 2 papers about 1 month ago

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published Dec 24, 2024 • 37

LearnLM: Improving Gemini for Learning

Paper • 2412.16429 • Published Dec 21, 2024 • 22

upvoted a collection about 2 months ago

DeTikZify

Collection

Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ • 11 items • Updated Dec 4, 2024 • 7

upvoted a paper about 2 months ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 79

upvoted an article about 2 months ago

Article

Rethinking Backpropagation: Thoughts on What's Wrong with Backpropagation

•

Dec 2, 2024

• 5

upvoted a paper about 2 months ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 129

upvoted an article 2 months ago

Article

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

•

Dec 4, 2024

• 76

upvoted a collection 2 months ago

LLaMA-O1-1129 Datasets, Models, Codes and Papers

Collection

8 items • Updated Dec 3, 2024 • 18

upvoted 2 papers 3 months ago

Cut Your Losses in Large-Vocabulary Language Models

Paper • 2411.09009 • Published Nov 13, 2024 • 45

Thinking LLMs: General Instruction Following with Thought Generation

Paper • 2410.10630 • Published Oct 14, 2024 • 18

upvoted an article 3 months ago

Article

Releasing the largest multilingual open pretraining dataset

and 2 others •

Nov 13, 2024

• 98