sthenno (Sthenno)

liked a model about 5 hours ago

Sakalti/Saka-14B

Text Generation • Updated 1 day ago • 5 • 2

liked a model 1 day ago

sometimesanotion/KytheraMix-7B-v0.2

Text Generation • Updated 1 day ago • 12 • 3

liked a dataset 1 day ago

open-thoughts/OpenThoughts-114k

Viewer • Updated about 3 hours ago • 114k • 27.9k • 303

updated a model 1 day ago

sthenno/tempesthenno-kto-0205-ckpt80

Updated 1 day ago • 17 • 2

liked a model 1 day ago

sthenno/tempesthenno-kto-0205-ckpt80

Updated 1 day ago • 17 • 2

published a model 1 day ago

sthenno/tempesthenno-kto-0205-ckpt80

Updated 1 day ago • 17 • 2

liked a Space 2 days ago

246

mergekit-gui

🔀

liked 2 models 2 days ago

mradermacher/tempesthenno-icy-0130-i1-GGUF

Updated 5 days ago • 1.46k • 2

mradermacher/tempesthenno-icy-0130-GGUF

Updated 5 days ago • 333 • 2

liked 5 models 3 days ago

reacted to sometimesanotion's post with 👍 5 days ago

Post

3093

**Update** Either I had some wrong numbers plugged in to estimate benchmark numbers from comparator, or the benchmark changed. Virtuoso Small v2 at 41.07 average is still very impressive, especially for writing draft copy for business purposes, while Lamarck remains a chatty generalist-reasoning model.

I've felt confident that 14B Qwen finetunes and merges could break the 42.0 average, and Arcee **came close** with https://huggingface.co/arcee-ai/Virtuoso-Small-2. Congratulations to @arcee-ai !

Just two months ago, it was easy to think that 14B had plateaued, that you could have high IFEVAL or high MUSR/MATH/GPQA at 14B, but not both. That barrier is completely shattered. I see a pathway to even better, and Virtuoso Small 2 is a big part of why. Very impressive work. This community would expect no less from Arcee.

Just look at this graph! Keep in mind, my merges here build on the first Virtuoso Small, and *-DS merges build on DeepSeek R1. There are some impressive merges in the pipe!

5 replies

·

replied to sometimesanotion's post 5 days ago

Congratulations as well! When I first saw the evaluation results for Virtuoso-Small-2, I quickly abandoned the release of "miscii-14b-0130". Although BBH and IFEval were once strengths of the miscii series - I admit that within my limited personal technical capabilities, I was indeed beaten by @arcee-ai ;)

reacted to sometimesanotion's post with 🚀 5 days ago

Post

3093

**Update** Either I had some wrong numbers plugged in to estimate benchmark numbers from comparator, or the benchmark changed. Virtuoso Small v2 at 41.07 average is still very impressive, especially for writing draft copy for business purposes, while Lamarck remains a chatty generalist-reasoning model.

I've felt confident that 14B Qwen finetunes and merges could break the 42.0 average, and Arcee **came close** with https://huggingface.co/arcee-ai/Virtuoso-Small-2. Congratulations to @arcee-ai !

Just two months ago, it was easy to think that 14B had plateaued, that you could have high IFEVAL or high MUSR/MATH/GPQA at 14B, but not both. That barrier is completely shattered. I see a pathway to even better, and Virtuoso Small 2 is a big part of why. Very impressive work. This community would expect no less from Arcee.

Just look at this graph! Keep in mind, my merges here build on the first Virtuoso Small, and *-DS merges build on DeepSeek R1. There are some impressive merges in the pipe!

5 replies

·

updated a model 5 days ago

sthenno-com/miscii-14b-1225

Text Generation • Updated 5 days ago • 527 • 23

liked a model 5 days ago

arcee-ai/Virtuoso-Small-v2

Text Generation • Updated 8 days ago • 622 • 21

updated a model 5 days ago

sthenno/tempesthenno-icy-0130

Text Generation • Updated 5 days ago • 112 • 8

Sthenno

AI & ML interests

Recent Activity

Organizations

sthenno's activity

Sakalti/Saka-14B

sometimesanotion/KytheraMix-7B-v0.2

open-thoughts/OpenThoughts-114k

sthenno/tempesthenno-kto-0205-ckpt80

sthenno/tempesthenno-kto-0205-ckpt80

sthenno/tempesthenno-kto-0205-ckpt80

mergekit-gui

mradermacher/tempesthenno-icy-0130-i1-GGUF

mradermacher/tempesthenno-icy-0130-GGUF

CultriX/Qwen2.5-14B-Ultimav2

CultriX/Qwen2.5-14B-Ultima

CultriX/Enhanced-TIES-Base-v1

CultriX/Qwen2.5-14B-Qwentangledv2

sometimesanotion/Qwenvergence-14B-v13-Prose-DS

sthenno-com/miscii-14b-1225

arcee-ai/Virtuoso-Small-v2

sthenno/tempesthenno-icy-0130