Raúl Garrido's picture

103 526

Raúl Garrido

happybydefault

·

https://happybydefault.com

AI & ML interests

None yet

Recent Activity

upvoted an article 1 day ago

Open-source DeepResearch – Freeing our search agents

liked a Space 2 days ago

PramaLLC/BEN2

upvoted a paper 3 days ago

MatAnyone: Stable Video Matting with Consistent Memory Propagation

View all activity

Organizations

happybydefault's activity

upvoted an article 1 day ago

Article

Open-source DeepResearch – Freeing our search agents

3 days ago

• 640

upvoted a paper 3 days ago

MatAnyone: Stable Video Matting with Consistent Memory Propagation

Paper • 2501.14677 • Published 13 days ago • 26

upvoted a paper 6 days ago

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Paper • 2501.18512 • Published 7 days ago • 25

upvoted a collection 7 days ago

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 10 items • Updated 8 days ago • 86

upvoted a collection 10 days ago

POTION

These are the flagship POTION models. Load them and use them with model2vec (https://github.com/MinishLab/model2vec) or sentence-transformers • 5 items • Updated 3 days ago • 10

upvoted 2 collections 11 days ago

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 11 days ago • 97

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 3 items • Updated 11 days ago • 322

upvoted a collection 12 days ago

DeepSeek-R1-ReDistill

Re-distilled DeepSeek R1 models • 4 items • Updated 7 days ago • 10

upvoted 2 collections 15 days ago

GTE ModernBERT

GTE Models Based on ModernBERT • 2 items • Updated 16 days ago • 12

Eagle 2

Eagle 2 is a family of frontier vision-language models with vision-centric design. The model supports 4K HD input, long-context video, and grounding. • 9 items • Updated 14 days ago • 30

upvoted a paper 17 days ago

DeepSeek-V3 Technical Report

Paper • 2412.19437 • Published Dec 27, 2024 • 49

upvoted a collection 17 days ago

DeepSeek-V3

3 items • Updated Jan 6 • 177

upvoted a collection 22 days ago

InternLM3

6 items • Updated 20 days ago • 21

upvoted a paper 27 days ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published 28 days ago • 90

upvoted a collection 27 days ago

TACO Models

This collection contains the best-performing TACO models based on LLaMA-3/Qwen2 and SigLIP/CLIP. • 3 items • Updated Dec 20, 2024 • 8

upvoted a collection 28 days ago

KaLM-embedding

8 items • Updated about 15 hours ago • 22

upvoted a collection 29 days ago

Phi-4

Phi-4 small language model. • 2 items • Updated 29 days ago • 46

upvoted 2 collections about 1 month ago

Dolphin 3.0

Dolphin 3.0 is the next generation of the Dolphin series of instruct-tuned models. Designed to be the ultimate general purpose local model. • 7 items • Updated Jan 5 • 63

QVQ

QVQ: Qwen models for visual reasoning • 7 items • Updated Jan 1 • 43

upvoted a collection about 2 months ago

ModernBERT

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 132