Cosmo's picture

Cosmo

cosmojg

·

https://cosmo.red

AI & ML interests

Machine learning and computational neuroscience

Recent Activity

liked a model about 22 hours ago

deepseek-ai/deepseek-vl2-tiny

liked a model about 22 hours ago

deepseek-ai/deepseek-vl2-small

liked a model about 22 hours ago

deepseek-ai/deepseek-vl2

View all activity

Organizations

None yet

cosmojg's activity

upvoted a paper about 22 hours ago

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Paper • 2412.10302 • Published Dec 13, 2024 • 14

upvoted a collection about 22 hours ago

DeepSeek-VL2

5 items • Updated 1 day ago • 52

upvoted an article 1 day ago

Article

Open-source DeepResearch – Freeing our search agents

3 days ago

• 648

upvoted a paper 6 days ago

Atla Selene Mini: A General Purpose Evaluation Model

Paper • 2501.17195 • Published 10 days ago • 30

upvoted a collection 6 days ago

Selene-1-Mini

11 items • Updated 3 days ago • 7

upvoted an article 7 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

10 days ago

• 648

upvoted an article 10 days ago

Article

Mastering Long Contexts in LLMs with KVPress

By

and 1 other •

14 days ago

• 59

upvoted a collection 10 days ago

SmolVLM 256M & 500M

Collection for models & demos for even smoller SmolVLM release • 12 items • Updated 14 days ago • 65

upvoted an article 10 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

15 days ago

• 119

upvoted a collection 10 days ago

DeepSeek-R1

8 items • Updated 17 days ago • 420

upvoted 2 collections 14 days ago

InternVL2.5

Better than InternVL 2.0 • 18 items • Updated 27 days ago • 81

Llama 3.3

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 127

upvoted a collection 16 days ago

FuseO1-Preview

System-II Reasoning Fusion of LLMs • 10 items • Updated 6 days ago • 17

upvoted 2 articles 17 days ago

Article

TTS Arena: Benchmarking Text-to-Speech Models in the Wild

Feb 27, 2024

• 49

Article

The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about...

By

•

17 days ago

• 56

upvoted a paper 20 days ago

OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking

Paper • 2501.09751 • Published 21 days ago • 47

upvoted an article 22 days ago

Article

Diving into MiniMax01 405B MoE

By

•

22 days ago

• 17

upvoted a paper 22 days ago

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers

Paper • 2410.10629 • Published Oct 14, 2024 • 11

upvoted 2 collections 28 days ago

Deepseek V3 (All Versions)

Deepseek V3 - available in bf16, original, and GGUF formats, with support for 2, 3, 4, 5, 6 and 8-bit quantized versions. • 3 items • Updated 2 days ago • 29

Cosmos

The collection of Cosmos models • 31 items • Updated 20 days ago • 254