Lucas Atkins's picture

Lucas Atkins PRO

Crystalcareai

·

lucasatkins7

AI & ML interests

MoE Models & Dataset Curation

Recent Activity

updated a model 8 days ago

arcee-ai/Virtuoso-Lite

reacted to sometimesanotion's post with 👍 9 days ago

**Update** Either I had some wrong numbers plugged in to estimate benchmark numbers from comparator, or the benchmark changed. Virtuoso Small v2 at 41.07 average is still very impressive, especially for writing draft copy for business purposes, while Lamarck remains a chatty generalist-reasoning model. I've felt confident that 14B Qwen finetunes and merges could break the 42.0 average, and Arcee **came close** with https://huggingface.co/arcee-ai/Virtuoso-Small-2. Congratulations to @arcee-ai! Just two months ago, it was easy to think that 14B had plateaued, that you could have high IFEVAL or high MUSR/MATH/GPQA at 14B, but not both. That barrier is completely shattered. I see a pathway to even better, and Virtuoso Small 2 is a big part of why. Very impressive work. This community would expect no less from Arcee. Just look at this graph! Keep in mind, my merges here build on the first Virtuoso Small, and *-DS merges build on DeepSeek R1. There are some impressive merges in the pipe!

reacted to sometimesanotion's post with 🚀 9 days ago

**Update** Either I had some wrong numbers plugged in to estimate benchmark numbers from comparator, or the benchmark changed. Virtuoso Small v2 at 41.07 average is still very impressive, especially for writing draft copy for business purposes, while Lamarck remains a chatty generalist-reasoning model. I've felt confident that 14B Qwen finetunes and merges could break the 42.0 average, and Arcee **came close** with https://huggingface.co/arcee-ai/Virtuoso-Small-2. Congratulations to @arcee-ai! Just two months ago, it was easy to think that 14B had plateaued, that you could have high IFEVAL or high MUSR/MATH/GPQA at 14B, but not both. That barrier is completely shattered. I see a pathway to even better, and Virtuoso Small 2 is a big part of why. Very impressive work. This community would expect no less from Arcee. Just look at this graph! Keep in mind, my merges here build on the first Virtuoso Small, and *-DS merges build on DeepSeek R1. There are some impressive merges in the pipe!

View all activity

Organizations

Crystalcareai's activity

New activity in arcee-ai/Virtuoso-Medium-v2-GGUF 13 days ago

Add Apache-2.0 license :3

#1 opened 13 days ago by

SaisExperiments

New activity in arcee-ai/SuperNova-Medius 22 days ago

This cross-architecture distillation, with Phi?

#14 opened 26 days ago by

sometimesanotion

New activity in arcee-ai/Virtuoso-Small about 2 months ago

Context size

#6 opened 2 months ago by

New activity in arcee-ai/Virtuoso-Small 2 months ago

Chat Template

#5 opened 2 months ago by

Question about model's origin

#2 opened 2 months ago by

sometimesanotion

Fix tokenizer.json with file from Qwen/Qwen2.5-14B

#3 opened 2 months ago by

MariusNocturnum

use the original Qwen2.5-14B-Instruct tokenizer

#4 opened 2 months ago by

Adding Evaluation Results

#1 opened 2 months ago by

leaderboard-pr-bot

New activity in PrimeIntellect/INTELLECT-1-Instruct 2 months ago

add Instruct datasets and base model

#1 opened 2 months ago by

New activity in arcee-ai/Arcee-VyLinh 2 months ago

Chạy trên nhiều GPU cùng 1 lúc để cải thiện hiệu năng

#1 opened 2 months ago by

New activity in arcee-ai/SuperNova-Medius 3 months ago

Update SuperNova-Medius with a merge with Qwen/Qwen2.5-Coder-14B-Instruct + Further Training 😋

#12 opened 3 months ago by

max output tokens?

#11 opened 3 months ago by

New activity in arcee-ai/SuperNova-Medius 4 months ago

Update _name to "arcee-ai/SuperNova-Medius"

#5 opened 4 months ago by

Adding Evaluation Results

#3 opened 4 months ago by

leaderboard-pr-bot

llama.cpp convert problem report(about `tokenizer.json`)

#2 opened 4 months ago by

2 base models = a nice merge UI on the model page

#1 opened 4 months ago by

New activity in Joseph717171/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base 4 months ago

Explain these Benchmark Results

#2 opened 4 months ago by

New activity in arcee-ai/Llama-3.1-SuperNova-Lite 4 months ago

Distill Llama-3.2-1B-Instruct from Llama-405B-Instruct to make SuperNova-Pico

#14 opened 5 months ago by

Adding Evaluation Results

#15 opened 4 months ago by

New activity in arcee-ai/Llama-3.1-SuperNova-Lite 5 months ago

Why is the tokenizer.json not the same as LLaMa-3.1-8B-Instruct

#6 opened 5 months ago by