Michael Feldman's picture

19 163

Michael Feldman

mfeldman143

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 hours ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

upvoted an article 2 days ago

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

liked a model 2 days ago

lerobot/pi0

View all activity

Organizations

mfeldman143's activity

upvoted a paper about 2 hours ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 2 days ago • 71

upvoted an article 2 days ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

3 days ago

• 69

upvoted a collection 8 days ago

Models, Jan 27

12 items • Updated 10 days ago • 1

upvoted a collection 9 days ago

Sapiens

Foundation models for human tasks. Code: https://github.com/facebookresearch/sapiens • 72 items • Updated Sep 18, 2024 • 53

upvoted a collection 11 days ago

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 11 days ago • 97

upvoted a paper 22 days ago

O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning

Paper • 2501.06458 • Published 26 days ago • 29

upvoted 3 papers 27 days ago

PokéLLMon: A Human-Parity Agent for Pokémon Battles with Large Language Models

Paper • 2402.01118 • Published Feb 2, 2024 • 31

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 29 days ago • 253

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12, 2024 • 70

upvoted 2 collections 29 days ago

NeMo Audio Codecs

A series of Neural Audio Codecs • 5 items • Updated 20 days ago • 11

Cosmos

The collection of Cosmos models • 31 items • Updated 20 days ago • 254

upvoted a paper about 2 months ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 106

upvoted a collection 2 months ago

Llama 3.3 (All Versions)

Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions. • 3 items • Updated 2 days ago • 35

upvoted a paper 3 months ago

FastDraft: How to Train Your Draft

Paper • 2411.11055 • Published Nov 17, 2024 • 10

upvoted an article 3 months ago

Article

Low Code Large Language Model Alignment

By

•

Nov 19, 2024

• 13

upvoted 2 collections 3 months ago

OpenCoder

OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 8 items • Updated Nov 23, 2024 • 79

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated about 8 hours ago • 214

upvoted a paper 5 months ago

Sapiens: Foundation for Human Vision Models

Paper • 2408.12569 • Published Aug 22, 2024 • 90