Edwin Santiago Alférez Baquero's picture

3 7 140

Edwin Santiago Alférez Baquero PRO

esab

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

Qwen/Qwen2.5-VL-7B-Instruct

reacted to merve's post with 🔥 3 days ago

This week in open AI was 🔥 Let's recap! 🤗 https://huggingface.co/collections/merve/january-31-releases-679a10669bd4030090c5de4d LLMs 💬 > Huge: AllenAI released new Tülu models that outperform DeepSeek R1 using Reinforcement Learning with Verifiable Reward (RLVR) based on Llama 3.1 405B 🔥 > Mistral AI is back to open-source with their "small" 24B models (base & SFT), with Apache 2.0 license 😱 > Alibaba Qwen released their 1M context length models Qwen2.5-Instruct-1M, great for agentic use with Apache 2.0 license 🔥 > Arcee AI released Virtuoso-medium, 32.8B LLMs distilled from DeepSeek V3 with dataset of 5B+ tokens > Velvet-14B is a new family of 14B Italian LLMs trained on 10T tokens in six languages > OpenThinker-7B is fine-tuned version of Qwen2.5-7B-Instruct on OpenThoughts dataset VLMs & vision 👀 > Alibaba Qwen is back with Qwen2.5VL, amazing new capabilities ranging from agentic computer use to zero-shot localization 🔥 > NVIDIA released new series of Eagle2 models with 1B and 9B sizes > DeepSeek released Janus-Pro, new any-to-any model (image-text generation from image-text input) with MIT license > BEN2 is a new background removal model with MIT license! Audio 🗣️ > YuE is a new open-source music generation foundation model, lyrics-to-song generation Codebase 👩🏻‍💻 > We are open-sourcing our SmolVLM training and eval codebase! https://github.com/huggingface/smollm/tree/main/vision > Open-R1 is open-source reproduction of R1 by @huggingface science team https://huggingface.co/blog/open-r1

liked a model 10 days ago

deepseek-ai/Janus-Pro-7B

View all activity

Organizations

esab's activity

liked a model 3 days ago

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • Updated about 17 hours ago • 307k • 331

reacted to merve's post with 🔥 3 days ago

Post

3718

This week in open AI was 🔥 Let's recap! 🤗 merve/january-31-releases-679a10669bd4030090c5de4d
LLMs 💬
> Huge: AllenAI released new Tülu models that outperform DeepSeek R1 using Reinforcement Learning with Verifiable Reward (RLVR) based on Llama 3.1 405B 🔥
> Mistral AI is back to open-source with their "small" 24B models (base & SFT), with Apache 2.0 license 😱
> Alibaba Qwen released their 1M context length models Qwen2.5-Instruct-1M, great for agentic use with Apache 2.0 license 🔥
> Arcee AI released Virtuoso-medium, 32.8B LLMs distilled from DeepSeek V3 with dataset of 5B+ tokens
> Velvet-14B is a new family of 14B Italian LLMs trained on 10T tokens in six languages
> OpenThinker-7B is fine-tuned version of Qwen2.5-7B-Instruct on OpenThoughts dataset

VLMs & vision 👀
> Alibaba Qwen is back with Qwen2.5VL, amazing new capabilities ranging from agentic computer use to zero-shot localization 🔥
> NVIDIA released new series of Eagle2 models with 1B and 9B sizes
> DeepSeek released Janus-Pro, new any-to-any model (image-text generation from image-text input) with MIT license
> BEN2 is a new background removal model with MIT license!

Audio 🗣️
> YuE is a new open-source music generation foundation model, lyrics-to-song generation

Codebase 👩🏻‍💻
> We are open-sourcing our SmolVLM training and eval codebase! https://github.com/huggingface/smollm/tree/main/vision
> Open-R1 is open-source reproduction of R1 by @huggingface science team https://huggingface.co/blog/open-r1

1 reply

liked a model 10 days ago

deepseek-ai/Janus-Pro-7B

Any-to-Any • Updated 6 days ago • 223k • 2.67k

liked a model 13 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 6 days ago • 1.54M • • 7.28k

liked a Space 14 days ago

177

ClearerVoice-Studio (Speech Enhancement, Separation and Extraction)

📈

Better AI powered platform to purify your speech signal

liked 3 models about 1 month ago

Alibaba-NLP/gte-Qwen2-1.5B-instruct

knowledgator/modern-gliner-bi-base-v1.0

Token Classification • Updated about 1 month ago • 229 • 24

answerdotai/ModernBERT-base

Fill-Mask • Updated 22 days ago • 8.44M • 722

liked 2 models about 2 months ago

urchade/gliner_multi-v2.1

Token Classification • Updated Apr 10, 2024 • 42.7k • 110

knowledgator/gliner-multitask-v1.0

Token Classification • Updated Dec 10, 2024 • 388 • 30

upvoted a collection 2 months ago

Dec 6 Releases 🎄

Collection

28 items • Updated Dec 9, 2024 • 10

liked a model 2 months ago

PleIAs/celadon

Text Classification • Updated Nov 3, 2024 • 192 • 28

liked a Space 2 months ago

Github Issue Generator

🧑

Generate structured GitHub issues

reacted to merve's post with ❤️ 3 months ago

Post

1518

Apple released AIMv2 🍏 a family of state-of-the-art open-set vision encoders
apple/aimv2-6720fe1558d94c7805f7688c
> like CLIP, but add a decoder and train on autoregression 🤯
> 19 open models come in 300M, 600M, 1.2B, 2.7B with resolutions of 224, 336, 448
> Load and use with 🤗 transformers

reacted to davidberenstein1957's post with 😎 3 months ago

Post

1992

For anyone who struggles with NER or information extraction with LLM.

We showed an efficient workflow for token classification including zero-shot suggestions and model fine-tuning with Argilla, GliNER, the NuMind NuExtract LLM and SpanMarker. @argilla

Video: https://youtu.be/JvLpaYgNd84?feature=shared
Notebooks and slides included to try it yourself 🙂