164 72 224

Philipp Schmid

philschmid

https://www.philschmid.de

AI & ML interests

None yet

Recent Activity

updated a dataset about 5 hours ago

philschmid/pdf-samples

published a dataset about 20 hours ago

philschmid/pdf-samples

upvoted a paper 2 days ago

Preference Leakage: A Contamination Problem in LLM-as-a-judge

View all activity

Organizations

philschmid's activity

upvoted a paper 2 days ago

Preference Leakage: A Contamination Problem in LLM-as-a-judge

Paper • 2502.01534 • Published 3 days ago • 34

upvoted an article 4 days ago

Article

Open-R1: Update #1

and 7 others •

5 days ago

• 237

upvoted an article 6 days ago

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

•

6 days ago

• 29

upvoted a paper 23 days ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published 24 days ago • 89

upvoted 2 papers about 2 months ago

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

Paper • 2407.21787 • Published Jul 31, 2024 • 12

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 106

upvoted 2 papers 3 months ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 41

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published Nov 4, 2024 • 47

upvoted 2 papers 4 months ago

Pyramidal Flow Matching for Efficient Video Generative Modeling

Paper • 2410.05954 • Published Oct 8, 2024 • 39

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 106

upvoted an article 4 months ago

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25, 2024

• 182

upvoted a collection 4 months ago

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 566

upvoted a paper 4 months ago

EuroLLM: Multilingual Language Models for Europe

Paper • 2409.16235 • Published Sep 24, 2024 • 26

upvoted a paper 5 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 136

upvoted 3 collections 5 months ago

upvoted an article 5 months ago

Article

Meet Yi-Coder: A Small but Mighty LLM for Code

•

Sep 4, 2024

• 15

upvoted 2 papers 5 months ago

Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models

Paper • 2408.02442 • Published Aug 5, 2024 • 21

Generative Verifiers: Reward Modeling as Next-Token Prediction

Paper • 2408.15240 • Published Aug 27, 2024 • 13